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Das schénste Denkmal, das ein Mensch 
bekommen kann, steht in den Herzen seiner 
Mitmenschen.' 

Albert Schweitzer 


'The most beautiful memorial which can be erected for somebody is the imprint he leaves behind 
in the hearts of others. 


Wenn er aber studiert, miissen ihm bedeutende 
ideale Ziele vorschweben 


The headline above is a passage from a personal statement written by Wolfgang 
Schwarz shortly after finishing school, explaining his views and goals while looking 
forward to taking up studies at the University of Erlangen. Literally, it says that a 
student’s motivation must be substantial and idealistic. At that time it was customary 
to write a proposal for enrollment at a German university. 


Fig. 1 Left: the 17-year-old Wolfgang Schwarz; right: excerpt from his essay. Published with kind 
permission of © Doris Schwarz, 2013 


As a profession I aim at the position of a high school teacher of mathematics and physics, 
or, if possible, the career of a university professor of mathematics. 


This is another quotation from Wolfgang Schwarz’s essay showing a rather self- 
confident young man with ambitious goals. And indeed, this promising young man 
lived up to his own expectations and became an outstanding mathematician. His field 
of research was number theory with an emphasis on analysis. Yet, his interests were 
as widespread as his knowledge, ranging from elementary to algebraic aspects of 
number theory, and from analysis to algebra. This broad spectrum is well reflected 


?This is our translation of the German original (see Fig. 1). Please note that the German héheres 
Lehramt stands for teaching pupils from grade 6 to 13 at a gymnasium, and Hochschule at that 
time was equivalent to university. 
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in the variety of research articles and surveys in this memorial volume, some of 
them enriched with personal reminiscences. 

Wolfgang Schwarz was born on the 21st of April 1934 in Selb (not far from Hof 
in northern Bavaria), which is well known for its famous porcelain manufactories. 
His father was a teacher at a primary school and, although he died untimely 
in 1945 during World War II, he left a deep impression on young Wolfgang, 
which is best manifested in his early curiosity about mathematics and natural 
sciences. Their mother raised Wolfgang and his brother Werner and encouraged 
their musical talents. In 1951 Schwarz obtained his university entrance certificate, 
called Reifezeugnis in German. He enrolled at the Friedrich-Alexander-Universitat 
Erlangen to study mathematics and physics; among his teachers were Otto Haupt, 
Georg Nobeling, Wilhelm Specht, and, most importantly, the eminent Theodor 
Schneider, who is famous for his elegant solution of the seventh Hilbert problem on 
the transcendence of numbers of the form w’, where a is algebraic and f irrational. 
After passing the state examination for teachers in mathematics and physics in 
1956 Wolfgang Schwarz continued his studies as an assistant of Schneider, first 
in Erlangen and later at the Albert-Ludwigs-Universitét Freiburg (Fig.2). His 


Fig. 2. Wolfgang Schwarz during his doctorate studies. Published with kind permission of © Doris 
Schwarz, 2013 


doctorate thesis of 1959 dealt with questions about representations of integers as 
sums of powers of prime numbers. For his habilitation,+ accomplished in 1964 also 
at Freiburg, he studied so-called Tauberian theorems with applications to partitions. 


The topics and methods used in both of his theses form a central part of twentieth-century 
analytic number theory. In this memorial volume, honoring the life and work of Wolfgang 


3Independently, Alexander Gelfond provided another solution about the same time. 


Habilitation is a degree following the doctorate, at that time necessary to obtain the permission 
for independent teaching (venia legendi) and for appointment as a professor. 
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Schwarz, the reader will find several contributions closely related to Schwarz’s work, 
e.g. those by Jorg Briidern, Rainer Dietmann & Christian Elsholtz, and Rebecca Ulrike 
Jakob, all relying on the circle method, as well as Aleksandar Ivi¢’s article on Tauberian 
theorems. Some of these new results use or extend previous ones obtained by Schwarz. In 
the background of Janos Pintz’s article on recent progress in the theory of prime number 
distribution are sieve methods, which is another direction of research where Schwarz made 
significant contributions and published monographs at an early age. 


In the late 1960s Wolfgang Schwarz was appointed professor at the Albert- 
Ludwigs-Universitat Freiburg, and in 1969 he became Ordinarius at the Johann- 
Wolfgang-Goethe- Universitat Frankfurt. In 1974, he and his family settled in the 
village of Ruppertshain near K6nigstein in the lovely Taunus mountain range 
north of Frankfurt. The Schwarz family quickly became acquainted with its new 
surroundings and made many friends. 

Wolfgang Schwarz’s mathematical ideas and inventions were creative and 
innovative. His broad knowledge together with his mastery of advanced techniques 
was and still is very impressive. He always had a desire for simplicity and elegance. 
This is well reflected in his numerous (more than one hundred) research articles (see 
the complete list of his publications subsequent to this preface). A few of them deal 
with elementary, diophantine and transcendental questions. 


The articles of Régis de la Bret&che & Gérard Tenenbaum, Stefan Porubsky, Andrzej 
Schinzel, Jan-Christoph Schlage-Puchta, Tarlok Shorey & Rob Tijdeman, and Jiirgen 
Spilker are elementary in nature and lie at the core of number theory. Diophantine aspects or 
transcendence topics are the major theme of the contributions by Ioulia Baoulina & Pieter 
Moree, Valentin Blomer, Yann Bugeaud, Peter Bundschuh & Keijo Vaananen, Carsten 
Elsner, Manfred Madritsch & Robert Tichy, and Jorn Steuding; some of them are directly 
related to results of Schwarz. 


One of Wolfgang Schwarz’s major research interests was the study of arithmeti- 
cal functions. Together with his colleague and friend Jiirgen Spilker from Freiburg, 
Schwarz published the standard reference in this field, continuing important works 
by Wintner, Delange, Wirsing, Elliott, and Halasz. Methods of probabilistic and 
functional-analytic flavor form an essential part of Schwarz & Spilker’s Arith- 
metical Functions and provide a rather new perspective on this classical topic. 
In particular, the concept of Ramanujan expansions, studies of so-called related 
functions, and Gelfand’s theory of maximal ideal spaces make reading this book 
a great pleasure. 


In this memorial volume several examples of recent and past work on arithmetical functions 
can be found in the articles of Peter Elliott, Karl-Heinz Indlekofer, Aleksandar Ivic, Lutz 
Lucht, Eugenijus Manstavitius, and Friedemann Tuttas, some of them directly related 
to Wolfgang Schwarz’s work: the contribution by Aleksandar Ivi¢ includes aspects of 
Schwarz’s study of the value-distribution of certain arithmetical functions; the surveys 
of Lutz Lucht and Karl-Heinz Indlekofer discuss, among other topics, questions about 
inversion of multiplicative functions, resp. integration theory for arithmetical functions. 


Wolfgang Schwarz has always been interested in establishing international con- 
tacts. He pursued this objective as one of the main organizers of the number theory 
meetings at Oberwolfach for two decades from 1974 until 1994 (together with 
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Fig. 3. Wolfgang Schwarz and his first doctoral student Karl-Heinz Indlekofer discussing the 
divisor function in 1993. Published with kind permission of © Karl-Heinz Indlekofer, 1993 


Hans-Egon Richert and Eduard Wirsing). The success and worldwide reputation 
of these workshops in the Black Forest emphasize the mathematical significance 
of both Wolfgang Schwarz and the group of German number theorists at this time. 
Another notable activity of Schwarz was the exchange program with the group of 
probabilists and number theorists around Jonas Kubilius at Vilnius University (even 
before Lithuania’s independence in 1990). 


In particular these Lithuanian contacts matched Schwarz’s research interests on probabilis- 
tic number theory and applications to arithmetical functions, well reflected in the article 
by Eugenijus Manstavicius. However, questions around the Riemann zeta-function and its 
relatives should be mentioned here as well. To illustrate this line of research one may list 
the contributions of Ramiinas GarunkStis & Justas Kalpokas, and Antanas Laurin¢ikas. The 
article by Helmut Maier & Michael Rassias falls into this category, too. Another related 
paper by Nicola Oswald & Jérn Steuding traces this direction of investigation in the research 
area of the famous mathematician Adolf Hurwitz who was born in Hildesheim. This article 
on the history of number theory reflects a further research interest of Wolfgang Schwarz. 


After Wolfgang Schwarz’s retirement in 2002, the history of mathematics and, 
in particular, the development of number theory, which he had always considered to 
play an important role in mathematical thinking and education, became the central 
topic of his mathematical activities. His extraordinary knowledge of mathematical 
and historical details often led to relevant and helpful remarks at conferences, more 
than once opening up a new point of view on a discussed unsolved problem or 


>The town that hosted the 2014 €£AZ conference where the first steps for this memorial volume 
were initiated. 
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Fig. 4 Wolfgang Schwarz playing the piano. Published with kind permission of © Doris Schwarz, 
2013 


a presented result. Wolfgang Schwarz’s other passion was classical music, which 
took an even larger role in his life as he was getting older. He played the piano 
to an almost professional level, and he composed his own pieces of music, which 
regularly have been performed by members of his family and also by students of the 
Hochschule fiir Musik und Darstellende Kunst Frankfurt (for example, to celebrate 
his 75th birthday). 


Fig. 5 Eva Schwarz and Valentin Blomer playing one of Wolfgang Schwarz’s pieces at the 
memorial colloquium during the 2014 €AZ conference. Published with kind permission of 
© Isa Lange, 2014 
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On Friday, the 19th of July 2013, Wolfgang Schwarz passed away at the age 
of 79. During the biannual conference Elementare und Analytische Zahlentheorie 
(EL AZ) hosted by the University of Hildesheim from July 26th to August Ist 2014, 
a special colloquium was held on Wednesday July 30th, dedicated to the memory 
of Wolfgang Schwarz. This was attended by his wife Doris and their daughter Eva, 
the conference delegates and several external guests. The idea to write a memorial 
volume to honor Schwarz’s life and work was born at about the same time. The 
series of EC AZ conferences had been initiated by Lutz Lucht in 2000 in order 
to provide a regular platform for young German number theorists to present their 
research and to meet with experts. Since then these workshops have continued 
to grow into an international conference with eighty participants from all over 
the world, invited morning lectures and parallel sessions in the afternoons. The 
program of the Memorial Colloquium at the Hildesheim €£AZ together with a 
group photograph can be found on the final pages of this volume. 


Besides Aleksandar Ivi¢’s opening lecture as well as the Wednesday colloquium talks given 
by Lutz Lucht, Karl-Heinz Indlekofer, and Jérn Steuding honoring Schwarz’s scientific 
work, this volume contains additional contributions presented at the €CAZ conference, 
some of them with personal reminiscences, for example, the one authored by Stefan 
Porubsky.° The photograph on the previous page shows Valentin Blomer and Schwarz’s 
daughter Eva playing one of Wolfgang Schwarz’s own compositions during the memorial 
colloquium. 


The third €€.4Z conference held in Mainz in 2004 was organized by Wolfgang 
Schwarz and this is just one of the many examples showing his dedication to 
support young researchers. Another one is his longstanding commitment to the 
Cusanuswerk at the University of Frankfurt as a student mentor, a position he was 
committed to for more than 30 years. It is noteworthy that this Catholic institution 
is named after the fifteenth-century universal scientist and mathematician Nicholas 
of Cusa. Over the years Schwarz and his Cusanuswerk students met twice a year at 
his home for an evening of culture and good food prepared by Doris. 


One of his students from the Cusanuswerk, Anne Henke, is now professor of mathematics 
and has contributed an article on her field of research, namely representation theory. This 
work as well as the articles by Samuel Patterson on variations of Gaussian sums, Jiirgen 
Sander & Torsten Sander on the interaction between number theory and (algebraic) graph 
theory, Eduard Wirsing on a classical function-theoretical result of Hardy, and Jiirgen 
Wolfart & Benjamin Miihlbauer on dessins d’enfants would most likely have met with 
Schwarz’s enjoyment in the linkage of different mathematical views. 


6The ‘Obituary for Wolfgang Schwarz’ by K.-H. Indlekofer, L.G. Lucht, and J. Steuding, published 
in Jahresber. Dtsch. Math.-Ver. 116, No. 3, 153-169 (2014) provides further information. 
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Yet, Wolfgang Schwarz was not only a researcher and teacher of mathematics, 
he also was well aware of the importance of activities beyond the blackboard. He 
engaged in university administration and was a member of the presidium of the 
Deutsche Mathematiker Vereinigung' (DMV) for several years, being its president 
in 1986/1987. In 1992/1993 he was chairman of the Konferenz der Mathematischen 
Fachbereiche® (KMathF). 


Arfaborwe, fr Menark weeoler, unt ih 
gfautke, das clan Uninenibitastidine clo 
rfodwumg , hie er rich trft im Leben 
crurbus wire , morurgrthmin Marin, 
Des Buctium Bit cen Menshen reife, 
Weterwmomenchiinrge we oe sams clien 


Fig. 6 Another quotation from Schwarz’s essay of 1951. Published with kind permission of 
© Doris Schwarz, 2013 


We conclude with a final quotation from Wolfgang Schwarz’s essay for university 
entry: 

Furthermore I wish to become an experienced and mature man. I believe that by studying at 

a university, a young person is able to anticipate much of the experience he would otherwise 

have to accumulate later in life. By studying, humans gain maturity, it enables one to view 

the great and the sublime, but also the small; it provides us with the opportunity to see the 

big picture [... ]. 


In our opinion, these words do not only show a young man’s perspective towards 
his future life, but they also reflect the lifelong motto of a great character who 
had so much empathy and was so supportive of his colleagues and friends. The 
mathematical community has lost an excellent and renowned number theorist and 


7 Association of German Mathematicians. 
8Literally the ‘Conference of the mathematical faculties and departments’ in Germany; there seems 
to be no English name. 
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a true friend. We will miss him while doing mathematics, making music, when 
playing chess or hiking, or simply in daily life. 


Hildesheim Jiirgen Sander 
Wiirzburg Jérn and Rasa Steuding 
November 2015 


Post Scriptum: We are grateful to Doris Schwarz for providing us with biographical 
details, her husband’s essay quoted above and several photographs, and to his 
daughter Karin Schwarz for proofreading. Further photographs in this volume have 
been contributed by friends and colleagues. We admire the artistic talent of Nicola 
Oswald, who created the watercoloured portrait of Wolfgang Schwarz on the front 
cover. Moreover, we thank Lutz Lucht for additional information as well as Karl 
Dilcher and Jan-Hendrik de Wiljes for technical support. Last but not least, we 
wish to express our gratitude to Mario Aigner, Sonja Gasser and the whole team 
at Springer Basel and Heidelberg for making our idea into a book. 


XV 


Wenn er aber studiert, mtissen ihm bedeutende ideale Ziele vorschweben 


PLOT ‘Jepues uosin¢ © Jo UOTssIUIod pury YIIM poyst[qng “WIsYysep[IH Ul aoUAIIJUOD ZY7? oy Jo J0)so0d oy, 


Wolfgang Schwarz’s Publications 


15. 
16. 


. Zur Darstellung von Zahlen durch Summen von Primzahlpotenzen. I. Darstel- 


lung hinreichend grosser Zahlen. J. Reine Angew. Math. 205 (1960/1961), 
21-47. 


. Zur Darstellung von Zahlen durch Summen von Primzahlpotenzen. II. J. Reine 


Angew. Math. 206 (1961), 78-112. 


. Weitere, mit einer Methode von Erdés-Prachar erzielte Ergebnisse. Math. 


Nachr. 23 (1961), 327-348. 


. Uber die Summe >>, <x ?(f(n)) und verwandte Probleme. Monatsh. Math. 66 


(1962), 43-54. 


. Irrationale Potenzreihen. Arch. Math. 13 (1962), 228-240. 
. Zur Darstellung einer Zahl als Summe einer k-ten Potenz einer potenzfreien 


Zahl und einer g-ten potenzfreien Zahl. Arch. Math. 14 (1963), 105-115. 


. Eine Bemerkung tiber fast-ganzwertige Funktionen. Monatsh. Math. 67 


(1963), 129-136. 


. Uber die Lésbarkeit gewisser Ungleichungen durch Primzahlen. J. Reine 


Angew. Math. 212 (1963), 150-157. 


. Weitere, mit einer Methode von Erdés-Prachar erzielte Ergebnisse. II. 


Monatsh. Math. 68 (1964), 75-80. 


. Einige Anwendungen Tauberscher Sdtze in der Zahlentheorie. Habilitationss- 


chrift, Universitat zu Freiburg i. Br., 1964. 


. Einige Anwendungen Tauberscher Satze in der Zahlentheorie. A. J. Reine 


Angew. Math. 219 (1965), 67-96. 
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. Some applications of Elliott’s mean-value theorem. J. Reine Angew. Math. 
307/308 (1979), 418-423. 

. with Spilker, J.: Wiener-Lévy-Satze fiir absolut konvergente Reihen. Arch. 
Math. 32 (1979), 267-275. 

. with Ivi¢é, A.: Remarks on some number-theoretical functional equations. 
Aequationes Math. 20 (1980), 80-89. 

. with Duttlinger, J.: Uber die Verteilung der pythagoriiischen Dreiecke. Colloq. 
Math. 43 (1980), 365-372. 

. with Kopetzky, H.G., Wirsing, E.: Nonnegative sums of roots of unity. Arch. 

Math. 34 (1980), 114-121. 

Fourier-Ramanujan-Entwicklungen zahlen-theoretischer Funktionen und 
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1981, 399-415. 

with Knopfmacher, J.: Binomial expected values of arithmetical functions. J. 
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with Spilker, J.: Remarks on Elliott’s theorem on mean-values of multiplica- 

tive functions. In: Recent Progress in Analytic Number Theory, Durham 1979, 
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with Spilker, J.: Eine Bemerkung zur Charakterisierung der fast-periodischen 

multiplikativen zahlentheoretischen Funktionen mit von Null verschiedenem 

Mittelwert. Analysis 3 (1983), 205-216. 

with Heppner, E.: Benachbarte multiplikative Funktionen. Studies in Pure 

Mathematics, 323-336, Birkhauser, Basel, 1983. 

with Waterhouse, W.C.: The asymptotic density of supersingular Fermat 

varieties. Arch. Math. 43 (1984), 142-144. 
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Mathematicians in general are sociable people. They meet at conferences and 
discuss their latest results and much more at conference dinners. 


Published with kind permission of © Stefan Porubsky, 2004 


The picture above shows Wolfgang Schwarz hosting a number theoretical meeting 
in Frankfurt in summer 2004, surrounded by (from left to right) Aleksandar Ivi¢, 
Antanas Laurin¢cikas, Jorn Steuding, Stefan Porubsky, and Jiirgen Wolfart. 
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Below a scene from a private round at Schwarz’s home in the Taunus village 
Ruppertshain with guests. From left to right: Aleksandar Ivicé, Andrzej Schinzel, 
Stefan Porubsky, Janos Pintz, Wolfgang Schwarz, and Antanas Laurin¢ikas. 


Published with kind permission of © Stefan Porubsky, 1989 


The photograph above shows Wolfgang Schwarz discussing mathematics with Karl 
Dilcher. In the next one, Rasa Steuding, Karl-Heinz Indlekofer, Kohji Matsumoto, 
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Wolfgang Schwarz, and Eugenijus Manstavicius (from left to right) having dinner 
in a Japanese restaurant. 


Published with kind permission of © Jorn Steuding, 2005 


The following photograph shows Wolfgang Schwarz during the 2012 E€AZ 
Conference at Schlo8 Schney talking with Lutz Lucht and his wife. 


Published with kind permission of © Stefan Porubsky, 2012 
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Professer Schwarz discussing mathematics with his colleagues Profs. Albrecht 
Pfister (left) and Jiirgen Spilker (right). 


Published with kind permission of © Jiirgen Wolfart, 2009 


Below another example, Wolfgang Schwarz in discussion with Eugenijus 
Manstavicius; in the background Jiirgen Sander (on the left) and Eduard Wirsing 
(in the middle). 


Published with kind permission of © Stefan Porubsky, 2012 


But sometimes mathematicians are silently watching someone else’s talk. The 
next photograph shows Giinter Pickert, Eduard Wirsing, Wolfgang Schwarz, and 
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Karl-Heinz Indlekofer (from left to right). 


Published with kind permission of © Jiirgen Wolfart, 2009 


Some mathematicians are genuine globetrotters. Wolfgang Schwarz was one of 
those, and he was a passionate alpinist and mountaineer (see the photographs in 
Lutz Lucht’s contribution). 


Published with kind permission of © Peter Elliott, 1975 
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The last - rather romantic - photograph shows him at the Glaswaldsee in the Black 
Forest, while the photograph below, showing Wolfgang and Doris Schwarz, was 
taken during a rainy day at the Curonian Spit in Lithuania. 


Published with kind permission of © Jorn Steuding, 2011 


The Schwarz family has always been very hospitable. Below a photo showing 
the beginning of an excursion of a larger group of tourists in the Taunus range 2004. 


From left to right: Wolfgang Schwarz, Jérn Steuding, Andrzej Schinzel, Doris 
Schwarz, Janos Pintz, Aleksandar Ivi¢, and Antanas Laurincikas. Published with 
kind permission of © Stefan Porubsky, 2004 
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At conferences mathematicians give and listen to talks. 


Published with kind permission of © Jiirgen Wolfart, 2009 


Eduard Wirsing (on the left) and Wolfgang Schwarz enjoying a humoristic 
moment at the €£.4Z Conference 2012 in Schlo& Schney. 
Sometimes there are moments for celebration. At the €24AZ Conference 2004 


Published with kind permission of © Stefan Porubsky, 2004 


in Mainz Wolfgang Schwarz was celebrating his 70th birthday. As a representative 
of the Lithuanian delegation Eugenijus Manstavicius is handing over a hand-made 
Lithuanian sash. 
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Group photographs are another favourite amusement of mathematicians during 
their meetings. 


Published with kind permission of © Peter Elliott, 1984 


These pictures show Wolfgang Schwarz, Peter Elliott, Dieter Wolke, Ernst 
Heppner, Karl-Heinz Indlekofer, and Imre Katai (above from left to right) as well 
as Gérald Tenenbaum, once again Schwarz, Richard Warlimont, and Lutz Lucht 
(below from left to right) at a number theory conference at the Mathematisches 
Forschungsinstitut Oberwolfach in 1984. 


Published with kind permission of © Peter Elliott, 1984 
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Published with kind permission of © Stefan Porubsky, 2012 


Wolfgang Schwarz answering questions after his talk at the €24AZ Conference 
2012 in Schlo& Schney. In the background chairman Jiirgen Sander. 

At such occasions one can sometimes meet other mathematician’s families. 
Wolfgang Schwarz and part of the Porubsky family having coffee and ice cream. 
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Forbidden Integer Ratios of Consecutive Power 
Sums 


Ioulia N. Baoulina and Pieter Moree 


To the memory of Prof. Wolfgang Schwarz 


Abstract Let S,(m) := 1* + 2" +--.+ (m—1)* denote a power sum. In 2011 
Bernd Kellner formulated the conjecture that for m > 4 the ratio S,(m + 1)/S;(m) 
of two consecutive power sums is never an integer. We will develop some techniques 
that allow one to exclude many integers p as a ratio and combine them to exclude 
the integers 3 < p < 1501 and, assuming a conjecture on irregular primes to be 
true, a set of density 1 of ratios p. To exclude a ratio p one has to show that the 
Erd6és—Moser type equation (o — 1)S;(m) = m* has no non-trivial solutions. 


Keywords Consecutive power sums * Erdés—Moser type equation 


2010 Mathematics subject classification: Primary 11D61; Secondary 11A07 


1 Introduction 


Power sums have fascinated mathematicians for centuries. In this paper we consider 
some Diophantine equations involving power sums, of which the Erdés—Moser 
equation 


1k 4 2% 4..-4 (m—2)* + (m— 1) = nt" (1.1) 
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is typical and the most famous one. This equation has the obvious solution (m, k) = 
(3, 1) and conjecturally no other solutions exist (this conjecture was formulated 
around 1950 by Paul Erdés in a letter to Leo Moser). Leo Moser [20], using only 
elementary number theory, established the following result. 


Theorem 1.1 (Leo Moser [20]) Jf (m,k) is a solution of (1.1) with k > 2, then 
m> 10!°. 


For the shortest proof of this result presently known, we refer to Moree [16]. 
Using very different techniques, namely continued fractions and a many decimal 
computation of log 2, Gallot et al. [6] established the current world record: 


Theorem 1.2 (Gallot et al. [6]) [fan integer pair (m,k) with k => 2 satisfies (1.1), 
then 


> 3.91392 10 Oe 


The bound m > 10!” seems feasible, but requires somewhat better computer 
resources than the authors of [6] had at their disposal. 

Let S,(m) := pears j* be the sum of the first m — 1 consecutive kth powers. In 
this notation we can rewrite (1.1) as 


Si(m) = m*, (1.2) 
In the literature also the generalized Erdés—Moser conjecture is considered. The 


strongest result to date is due to the second author [17] who proved the following. 


Theorem 1.3 For a fixed positive integer a, the equation S;(m) = am has no 
integer solutions (m, k) with 


k>2,  m<max(10"!",a- 107). 


Interestingly, the method of Gallot et al. allows one only to deal with a specific value 
of a, for a general a only the elementary method of Moser is available. 

Kellner [10] conjectured that if k and m are positive integers with m > 3, the ratio 
S;(m + 1)/S;(m) is an integer iff (m,k) € {(3, 1), (3, 3)}. Noting that $;(m + 1) = 
S;(m) + m*, one observes that this conjecture is equivalent with the following one. 


Conjecture 1.4 (Kellner—Erdés—Moser) Let m > 3. We have 
aS;,(m) = m* (1.3) 


iff (a,m,k) € {(1, 3, 1), (3,3, 3)}. 


If this conjecture holds true, then obviously so does the Erd6s—Moser conjecture. 
However, whereas the Erdés—Moser conjecture is open, we are able to establish the 
unsolvability of (1.3) for many integers a. 
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Theorem 1.5 Jf a has a regular prime as divisor or 2 < a < 1500, then the 
equation aS,(m) = m* has no solution with m > 4. 


(We consider 2 to be a regular prime, see Sect.2 for details.) The first restriction 
on a is not very difficult to prove, but powerful in its consequences. Assume that 
there exists a real number 5 < | such that the number of irregular primes p < x 
is bounded above by 6x/logx as x — oo. It then follows (see Sect. 2) that, for a 
set of integers a of density 1, aS,(m) = m* has no solution with m > 4. The first 
restriction implies that in order to exclude the a in the range 2 < a < 1500, one 
has to exclude a = 37? and all irregular primes in this interval. These a can be dealt 
with using various technical and not very general necessary conditions for (1.3) to 
be solvable. 

In case we are not able to exclude a square-free a, we are able to show that if (1.3) 
holds, then both & and m are large. 


Theorem 1.6 Suppose that aS,(m) = m‘, m => 4 and a is square-free, then both k 
and m exceed 3.44 - 10°. 


We would like to point out that for solutions with m = 1 (mod 3) orm = 1 
(mod 30) much larger lower bounds hold true (see Theorem 8.4). 

For a fixed integer a > 1 it is not known whether there are finitely many solutions 
(m, k) of (1.3) or not. In this direction we can only contribute the following modest 
result. 


Proposition 1.7 Let (m,,k,) and (mz,k2) be different solutions of aS,(m) = m, 


Then m, 4 mz and k, ¥ kp. 


A final approach for dealing with Eq. (1.3) is to try to prove that k is divisible 
by 120 say. Once established, there are many options on how to get an even bigger 
number to divide k. A cascade of ways how to proceed further arises and it seems 
very likely then that also in this case a + 1 cannot occur as a ratio. We demonstrate 
this cascade process in Sect. 7. Paul Tegelaar [24] jokingly called this ‘the method 
of infinite ascent’. 

After discussing some basic material on Bernoulli numbers and power sums 
in Sect.2, we obtain a crucial result (Theorem 3.11) relating Bernoulli numbers 
and solutions of the Kellner—Erdés—Moser equation in Sect. 3. Integers a that are 
a product of irregular primes cannot be immediately excluded, and for these one 
can use helpful pairs, see Sect.4. They allow one to rule out that k = c (mod d) 
for many even integers c > 2 and d. In Sect.5 we demonstrate with both an easy 
and a difficult example how to exclude a given integer ratio p. Table 3 in Appendix 
illustrates in a compact way how to show that 3 < p < 1501 are forbidden ratios. 
Some ratios are clearly much easier to exclude than others and this is discussed in 
Sect. 6. In Sect. 7 we discuss how to show that a given integer divides k. In Sect. 8 
we reason in the way of Moser to derive lower bounds for k and m in case a is 
squarefree. In Sect.9 proofs of the new results announced above are given. These 
are mainly based on work done in earlier sections. In the final section, Sect. 10, 
some further properties of potential solutions of the Kellner-Erdés—Moser equation 
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are derived. The proof that the integers 2 < a < 1500 are forbidden makes use of 
Tables 2 and 3 in Appendix. 

A survey of earlier work on Erdés—Moser type conjectures can be found in 
Moree [17], also see [3, Chap. 8], for an expository account of the work of Gallot 
et al. [6]. 


2 Preliminaries on Bernoulli Numbers and Power Sums 


Lemma 2.1 (Carlitz—von Staudt) Let k and m be positive integers, then 


m—1 m(m—1) j ‘ 

: 0 (mod —““—) ifk is odd, 

Sem) = DoF = : 
j=l 


= plm(p—lk » (mod m) otherwise. 


This result, with some small error (cf. Moree [14]), was published in 1961 by 
Carlitz. For an easy reproof of the above result, see Moree [16]. 
Recall that the Bernoulli numbers B, are defined by the power series 


They are rational numbers and can be written as By = U,/Vx, with Vi > O and 
gced(U,x, Vi) = 1. One has Bo = 1, By = —1/2, Bp = 1/6 and By+4, = 0 forj > 1. 

In the next four lemmas, we record some well-known facts about the Bernoulli 
numbers (see [7, Chap. 15]). 


Lemma 2.2 (von Staudt—Clausen) /fk > 2 is even, then Vz = I p-D|kP: 


Lemma 2.3 (Kummer Congruence) Let k > 2 be even and p be a prime with 
(p—1) +k. Ifk =r (mod p— 1), then B,/k = B,/r (mod p). 


Lemma 2.4 For any integers k => 1 andm > 2, 


k : 
k mit! 
Si(m) => > (jan. 


j=0 


Lemma 2.5 (Voronoi Congruence) Let k and m be positive integers, where m > 2 
and k > 2 is even, then V;S,(m) = Uym (mod m?). 


The following lemma gives a refinement of the Voronoi congruence (see [9, 
Proposition 8.5]). 
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Lemma 2.6 Let k and m be positive integers, where m > 2 andk > 6 is even. If 
m | Ux, then ViS;(m) = Ugm (mod m?). 


A prime p will be called regular if it does not divide any of the numerators U, 
with even r < p — 3, otherwise it is said to be irregular. The pairs (r, p) with p | U, 
and even r < p — 3 are called irregular pairs. At a first glance this looks like a 
strange definition, but by celebrated work of Kummer [11] can be reformulated as: 
a prime p is irregular if and only if it divides the class number of Q(¢,). The first 
few irregular primes are 37,59, 67, 101, 103, 131, 149,.... It is known that there 
are infinitely many irregular primes, cf. Carlitz [5]. It is not known whether there 
are infinitely many regular primes. Let z,(x) denote the number of irregular primes 
p < x. Recently Luca et al. [12, Theorem 1] showed that 


log log x 
(x) = (1+ 0o(1)) a, 
log log log x 


Conjecturally, cf. Siegel [23], and in good agreement with numerical work, we 
should have 


1 x 
nx) ~ (1— ,) mG) = 0.3985 5, 


Let N(x) denote the number of integers n < x that are composed only of irregular 
primes. If we assume that 


tet = Bes 1: (2.1) 
logx 


then by Moree [18, Theorem 1] we have N,(x) ~ cxlog®! x as x > 00, withca 
positive real constant. (Kummer conjectured that 6 = 1/2.) The latter result is of 
Wirsing type (cf. Schwarz and Spilker [22, pp. 65—76]). 

For more results on Bernoulli numbers see, e.g., the book by Arakawa et al. [1]. 


3 The Kellner—Erdés—Moser Conjecture 


In this section, we will use properties of Bernoulli numbers to study the non- 
trivial solutions of the equation aS;(m) = m*. This will then lead us to establish 
Theorem 3.11. As a bonus we will conclude that if aS;(m) = m* has non-trivial 
solutions, then a must be either 1 or a product of irregular primes. 

First assume that m = 2. Then a = 2*. Next assume that m = 3. Then we 
must have a(1 + 2") = 3* and hence a = 3° for some e < k. It follows that 1 + 
2 = 3*. This Diophantine equation was already solved by the famous medieval 
astronomer Levi ben Gerson (1288-1344), alias Leo Hebraeus, who showed that 8 
and 9 are the only consecutive integers in the sequence of powers of 2 and 3, see 
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Ribenboim [21, pp. 124-125]. This leads to the solutions (e, k) € {(0, 1), (1, 3)} and 
hence (a,m,k) € {(1,3, 1), (3,3, 3)}. Now assume that m > 4 and k is odd. Then 
by Lemma 2.1 we find that m(m — 1)/2 divides m*, which is impossible. We infer 
that, to establish Conjecture 1.4, it is enough to establish the following conjecture. 


Conjecture 3.1 The set 
A = {a> 1: .aS;(m) = m has a solution with m > 4 and k even} 


is empty. 


Lemma 3.2 Suppose that aS,(m) = m* with m > 4 and k even. If p is a prime 
dividing m, then (p — 1) ¢ k. 

Proof Assume that p | m and (p — 1) | k. Let p® || mand p! || a, where e > 1, 
f = 0. Using Lemma 2.1 we find that $,(m) = a SK(P*) = =F (mod p*). Hence 
aS;(m) = — (mod pot), and so m* = —< (mod pot). Since p**/—! || awe 
deduce that p°t/—! || m*, and thus f = (k — le + 1 > k. Thena > p* > 2* and 
aS;(m) > 2*(m— 1)* > m*, which contradicts the fact that aS,(m) = m*. 


Corollary 3.3 Suppose that aS;(m) = m* with m => 4 and k even, then we have 
gcd(m, 6) = 1. 


Corollary 3.4 If gcd(a, 6) 4 1, thena ¢ A. 


Lemma 3.5 Suppose that aS;(m) = m‘ with m => 4 and k even. Suppose that a has 
no prime divisor p satisfying p < 2°. Then a | mI4/s1-!, 


Proof Assume that a + ml*/s|-!, Since each prime divisor of a divides m, there 
exists a prime p such that p!*/s! | a. By assumption p > 2°. Then a > (2°)!*/s] > 24 
and a$;(m) > 2*(m—1)* > m*, which is a contradiction. 


On combining the latter lemma and Corollary 3.4 we obtain the following result. 
Corollary 3.6 Suppose that aS,(m) = m* with m => 4 and k even, then a | m*&—)/?, 
Lemma 3.7 Suppose that aS;(m) = m* with m > 4 and k even, then m | Ux. 


Proof Multiplying the Voronoi congruence by a and using the fact that aS;(m) = 
m*, we deduce that Vim‘ = Uzam (mod am’). Since m? | m*+?)/? and, by 
Corollary 3.6, a | m*~”/, we have am? | U,am, that is m | Ug. 


Corollary 3.8 Suppose that aS;(m) = m‘ with m > 4 and k even, then k > 10. 
Lemma 3.9 Suppose that aS;(m) = m* with m > 4 and k even, then m? | Ux. 


Proof By Corollary 3.8, k => 10. Using Lemma 2.6 instead of the Voronoi congru- 
ence and proceeding then by the same argument as in the proof of Lemma 3.7, we 
deduce that am? | U,am, and so m? | U,. 
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Since U; is square-free for any even k < 50, we have 
Corollary 3.10 Suppose that aS,(m) = m‘ with m > 4 and k even, then k > 50. 
In case a = | the next result is Lemma 10 of [19]. 


Theorem 3.11 Suppose that aS;(m) = m* with m = 4 and even k. Let p be a prime 
dividing m. Then 

(a) p is an irregular prime; 

(b) k 0,2, 4, 6,8, 10, 14 (mod p — 1); 

(c) ord, (By/k) > 2 ord, m > 2; 

(d) k=r (mod p — 1) for some irregular pair (r, p). 

Proof By Corollary 3.3 and Lemmas 3.2 and 3.7 we see that p > 5, (p—1) + kand 


p | Ux. If p + k, then ord, (B,/k) > 0 and p is irregular. Now assume that p | k, ie., 
ord, k = 1. In view of Corollary 3.8 we have k > 10. Further 


k _ 
k(k —1) ' k\_ omit 
S; =B ——— By By_;——. 
x(m) xm + r eo” + » ( ar 


Hence 
me! _ Be k= (k — It» mi? 
1B mn al 
= r ym +m es Ny eMsG+DI (3.1) 
By Lemma 2.2 we have ord, Vi—-2 < 1, and hence 
k-1 2 
ord, By-2m* | = 2 ord, m— ord, Vy_z = 1. (3.2) 
Further, forj = 4,5,..., k, 
mi? 1- 1 j 
ord, ( = J 2) ord, mPOA D 5 j- sD 
(j+ 1)! -—1 4 


where o,(j-+ 1) denotes the sum of the digits of j-+ 1 written in the base p. Therefore 


(k— I)! mi? 
my ai By; iGEDI > 2ord, m> 2. (3.3) 
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Note that 
mk/2 k DD ag 
ord, ae ee ord, m — ord, k > 5 pr* — ord, k 
> 22% kl _ ord, k > ord, k > 1. (3.4) 


Using Corollary 3.6 we obtain 


p@-D/2 
ord, > 0. (3.5) 
a 
It follows from (3.4) and (3.5) that 
“a m&-2/2— yyk/2 
ord, {| —— } = ord, -—]>1. (3.6) 
ak a k 


Combining (3.1)-(3.3) and (3.6), we deduce that ord,(B,/k) > 0, and so p is 
irregular. This completes the proof of part (a). Part (b) is a consequence of part (a), 
Lemma 3.2 and the Kummer congruence. In the case p + k, part (c) follows from 
Lemma 3.9. Now assume that p | k. By part (a), p is an irregular prime, and so 
p > 37 > 2°. On combining Lemma 3.5 with s = 5 and Corollary 3.10, (3.5) is 
sharpened to 


m&-2/2 
ord, > 2ord, m. 
a 


Combining the latter estimate with (3.4) gives 


me! 
ord, ( i ) > 2ord,m. (3.7) 
a 


Further, by part (b) and the von Staudt—Clausen theorem, ord, Vi_2 = 0. Combin- 
ing (3.1), (3.2), (3.3), and (3.7), we complete the proof of part (c). Part (d) is a direct 
consequence of part (c), the fact that (p — 1) + k, and the Kummer congruence. 


Corollary 3.12 Ifa has a regular prime divisor, thena ¢ A. 


Corollary 3.13 Let p; and pz be distinct irregular prime divisors of a. Assume that 


for every pair (r1,p1), (’2,P2) of irregular pairs we have gcd(p; — 1,p2 — 1) + 
(7) —12). Thena ¢ A. 


Example Suppose that 37 - 379 | a. If a € A, then aS,(m) = m* with m > 4 
and k even and both 37 and 379 must divide m. There is one irregular pair 


(32,37) corresponding to 37 and two irregular pairs (100,379) and (174, 379) 
corresponding to 379. By Theorem 3.11 (d), & must be a simultaneous solution 
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of the congruences k = 32 (mod 36) and k = 100 or 174 (mod 378), which is 
impossible as gcd(36, 378) = 18, 18 + (32 — 100) and 18 } (32 — 174). Hence 
adA. 


4 Helpful Pairs 


Helpful pairs will be used to show that certain ratios are forbidden (Sect.5) and 
to show that certain numbers have to divide k (Sect.7). In both cases one has to 
show that k is not in certain congruence classes. In order to show that a certain ratio 
is forbidden, we have to exclude all the congruence classes with an appropriate 
modulus. In order to show that a certain even number d divides k, we do this by 
excluding all the congruence classes 2i (mod d) for 1 < i < d/2. If p | ais an 
irregular prime, d = p — 1, then by Theorem 3.11 we immediately exclude many 
congruence classes. 

The exclusion of a congruence is achieved by a helpful pair and the procedure is 
described just after the proof of the crucial Lemma 4.4. 


Definition 4.1 For a positive integer a let us call a pair (t, q)a with q a prime and 
2<t<q-—3 evento be helpful if q + aandaS,(c) 4 c' (mod q) for every integer 
c satisfying 1 < c < q—1. Ifq is an irregular prime, we require in addition that 
(t,q) should not be an irregular pair. 


Proposition 4.2 Let q => 5 be a prime and a be a positive integer. Then (2, q)a is a 


helpful pair if and only if (Hees) =-1. 


Proof Note that By = 1/6 and hence (2,q) cannot be an irregular pair. Since 
Sx(c) = (2c? — 3c? + c)/6, we see that (2,q)q is a helpful pair if and only if 
q + aand a(2c? — 3c? +c) # 6c” (mod q) forc = 1,...,q—1, that is, if and only 
if a(2c? — 3c + 1) ¥ 6c (mod q) forc = 0,...,q—1,ie., if and only if we have 
Ce) _ (zeae) =| 

q q : 
Proposition 4.3 Let q => 7 be a prime with (+) = —l. Then (4, q)q-2 is a helpful 
pair. 
Proof From (+) = —1 we deduce that 6n? + 10n — 1 # O (mod q) forn = 
1,...,q—1. This implies that 30(q — 2)S4(c) — 30c* = —2c(6c* + 10c? — 1) #0 
(mod q) forc = 1,...,q—1. Since By = —1/30, (4, q) is not an irregular pair. 
Thus (4, q)q—2 is a helpful pair. 


Lemma 4.4 Let 2 < t < q—3 and q bea prime. If (t,q)a is a helpful pair and 
aS;(m) = m* with k even, then we have k  t (mod q— 1). 


Proof Assume that k = t (mod q — 1). By Theorem 3.11 (d) we must have g + m, 
for otherwise (f, q) is an irregular pair, contradicting the definition of a helpful pair. 
Thus we can write m = moqg + b with 1 < b < q—1. By Lemma 2.1 we have 
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q | S:(q). We now find, modulo q, S;,(m) = S,(m) = moS,(q) + S,(b) = S;,(b). Thus 
if (1.3) is satisfied we must have aS,(b) = b' (mod q). By the definition of a helpful 
pair this is impossible. 


Ruling Out Congruence Classes fork The helpful pairs give us a chance to rule out 
k that satisfy certain congruences of the form k = c (mod d) with c > 2 andd 
even. We first list all primes g => 5 such that q — 1 divides d. Denote these primes 
by qi,...,qs. Let t; denote the least nonnegative integer congruent to c modulo 
qi — 1. If one of the pairs (¢;, gi)q is helpful, by Lemma 4.4 we have ruled out k = c 
(mod d). If this does not work, we multiply d by an integer € > 2 (the lifting factor). 
Our original congruence is now replaced by £ congruences, k = c + jd (mod €d), 
0 <j < €. For each of these congruences we now continue as before. In certain 
cases we find that each of the lifted congruences is ruled out by a helpful pair and 
then we are done. This situation is described in Proposition 4.5 below. If not all of 
the lifted congruences are excluded by helpful pairs, we can lift the bad congruences 
still further. The above procedure is not systematic and each stage the danger lurks 
that we get too many congruence classes we cannot exclude anymore. 


Proposition 4.5 Let p be an irregular prime dividing a. Assume that for every 
irregular pair (r,p) there exists a positive integer ¢, such that for every j = 
0,1,...,, — 1 there is a helpful pair (t;,qj)a with (qj; — 1) | €(p — 1) and 
ti =r+j(p— 1) (mod gj — 1). Thena ¢ A. 


Proof Since p must divide m, Theorem 3.11 (d) yields k = r (mod p— 1) for some 
irregular pair (r, p). Hence there exists j € {0,1,...,&-—1} such thatk = r+j(p—1) 
(mod £,(p — 1)). Then we have k = 4 (mod q; — 1) for the helpful pair (4, qj)a, 
which contradicts Lemma 4.4. 


Corollary 4.6 Under the conditions of Proposition 4.5, we have ab ¢ A for any 
positive integer b = 1 (mod Q), where Q denotes the least common multiple 
of all components qj of helpful pairs constructed for all irregular pairs (r,p) 
corresponding to p. 


5 Excluding a Given Ratio p 


Let p => 3. Write a = p—1. If ahas no regular prime divisor, the only way we know 
to exclude p is by using helpful pairs or invoking Corollary 3.13. We demonstrate 
this with two examples (an example of the usage of Corollary 3.13 we already gave 
immediately following the statement of Corollary 3.13). 


Easy example: a= 673. 


There are two irregular pairs (408,673) and (502,673) corresponding to 673. 
Theorem 3.11 (d) yields k = 408 or 502 (mod 672). If k = 408 (mod 672), then 
k = 8 (mod 16). The latter is impossible since (8, 17)10 is a helpful pair by Table 2 
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in Appendix. If k = 502 (mod 672), then k = 2 (mod 4), which is impossible as 
(2, 5)3 is a helpful pair by Table 2 in Appendix. 


Difficult example: a= 653. 


There is one irregular pair (48,653) corresponding to 653, and sok = 48 
(mod 652). We have 652 = 2? - 163. There are no helpful pairs (48, q) with 
(q — 1) | 652. So we have to use a lifting factor @. It turns out that £ = 4 is a 
useful factor. So that is why we use it in the first step. 


Step 1. We have k = 48 or 700 or 1352 or 2004 (mod 2608). The case k = 
48 (mod 2608) is impossible, since (48, 2609)¢53 is a helpful pair. If k = 
700 or 1352 (mod 2608), then k = 8 or 12 (mod 16), which is impossible as 
(8, 17)7 and (12, 17); are helpful pairs. Thus k = 2004 (mod 2608). 

Step 2. We have k = 2004 or 4612 or 7220 or 9828 or 12436 (mod 13040). If 
k = 4612 or 7220 or 9828 (mod 13040), then k = 12 or 20 or 28 (mod 40). 
From the fact that (12, 41)3g, (20, 41)33 and (28,41)3g are helpful pairs we 
deduce that the latter congruence is impossible. Hence k = 2004 or 12436 
(mod 13040). 

Step 3. We have k = 2004 or 12436 or 15044 or 25476 or 28084 or 38516 
(mod 39120). If kK = 15044 or 28084 (mod 39120), then k = 4or14 
(mod 30), which is impossible as (4,31)2 and (14,31). are helpful pairs. If 
k = 2004 or 25476 (mod 39120), then k = 24 or 36 (mod 60). The latter is 
impossible, since (24, 61)43 and (36, 61)43 are helpful pairs. The case k = 12436 
(mod 39120) implies k = 196 (mod 240), which is impossible as (196, 241)17; 
is ahelpful pair. The remaining case k = 38516 (mod 39120) is also impossible, 
since in this case k = 9176 (mod 9780) and (9176, 9781)¢53 is a helpful pair. 


Remark Using helpful pairs, we can find some infinite families of forbidden ratios. 
For example, let p = 37° + 1 for some positive integer s. The prime 37 is 
irregular and (32,37) an irregular pair. If 37°S,;(m) = m* with k even then, by 
Theorem 3.11 (d), we have k = 32 (mod 36). This implies that k = 8 (mod 12). 
Since (8, 13)37. is a helpful pair if and only if 37° = 1 or 2 or6or8 or 11 
(mod 13) (see Table 2 in Appendix), we deduce that p = 37° + 1 is a forbidden 
ratio for any s = 0 or | or 7 or 9 or 11 (mod 12). 


6 Bad Ratios 


Table 3 in Appendix gives a list of ratios we excluded and the helpful pairs used to 
do so. The attentive reader will notice that various ratios p are apparently bad and 
difficult to exclude. These are related to a = p — | that are of the form (2p + 1)° 
with p a Sophie Germain prime. Recall that a prime p is said to be a Sophie Germain 
prime if also 2p + 1 is a prime. Heuristics suggests that there ought to be infinitely 
many Sophie Germain primes such that 2p + | is an irregular prime. 
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Conjecture 6.1 There are infinitely many primes p such that 2p + 1 is an irregular 
prime. 


Let p be a prime such that 2p + | is an irregular prime and let (r, 2p + 1) be an 
irregular pair. In case we want to rule out k = r (mod 2p) we are in bad shape to 
start with. We are directly forced here to use a lifting factor € > 2 (as the list of 
primes 5 < gq < 2p + 1 with (g — 1) | 2p is empty here). The next result shows that 
we are in even worse shape, since helpful pairs with primes gq = 2pu + | > 6p have 
to be used. 


Proposition 6.2 Let p be a prime such that 2p + 1 is an irregular prime dividing 
a and let (r,2p + 1) be an irregular pair. Let £ be a positive integer with p t £, let 
qo. 41,-+-.qe-1 be odd primes with (q; — 1) | 22 (not necessarily distinct) and let 
to, t,...,t¢-1 be positive integers satisfying the conditions t, = r + 2pj (mod qj — 
1),0 <j < €—1. Then at least one of the pairs (to, do)a, (ti, ia: +++» (te-1, Ge—1a 
is not a helpful pair. 


Proof Since r is even and p } &, there exists a j with 0 < j < €—1 such that 
pj = —r/2 (mod £). Hence 2¢ | (r + 2pj). This implies that (g; — 1) | t;, and so 
(t;, dja is not a helpful pair. 


In case we are not able to exclude such a bad ratio, we might try at least to show 
that the k of a solution has to be highly divisible. In the next section we demonstrate 
this for the bad ratio 6780. 


7 Divisibility of k 


In this section, we consider the case a = 6779 = 2- 3389+ 1 and show that for 
a non-trivial solution k is divisible by a large number. We will present a heuristic 
argument here why we think that for this a there are no solutions. We expect that a 
similar reasoning might work for other values of a as well, once one can establish 
that a smallish number like 120 divides k. 

We start by discussing a baby example. 


Proposition 7.1 


(a) Ifa =1 or 2 or3 (mod 5), then 4 | k. 

(b) Ifa = 1 or3 or 5 (mod 7), then 6 | k. 

(c) Ifa = 6 or7 (mod 11), then 10 |k. 

(d) Ifa =2 or 8 or 11 (mod 13), then 12 | k. 

(e) Ifa = 1 or 6 (mod 13), then 6 | k. 

(f) Ifa =1 or 5 (mod 11) anda = 15 (mod 31), then 10 | k. 


Proof If, e.g., a = 2 (mod 13), we see from Table 2 in Appendix that the pairs 
(2, 13)a, (4, 13)a, (6, 13)a, (8, 13)q and (10, 13), are all helpful. The final assertion 
follows from glancing at an extended version of Table 2 in Appendix. 
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Now let us consider a more serious example, with p = 6780 a bad ratio. 
Proposition 7.2 If 6779S,(m) = m*, then 2° . 3° -5*-7-11-13 | k. 


Proof We start with the congruence k = 3994 (mod 6778), which is a consequence 
of Theorem 3.11 (d) and the fact that there is only one irregular pair (3994, 6779) 
corresponding to 6779. 


Step1. We have k = 3994 or 10772 or 17550 (mod 20334). If k = 
3994 or 10772 (mod 20334), then k = 2 or 4 (mod 6), which is impossible as 
(2, 7)3 and (4, 7)3 are helpful pairs. Hence k = 17550 (mod 20334) and 2-3 | k. 

Step 2. We have k = 17550 or 37884 or 58218 (mod 61002). The case k = 
58218 (mod 61002) is impossible, since in this case k = 6 (mod 18) and 
(6, 19)15 is a helpful pair. Hence k = 17550 or 37884 (mod 61002). 

Step 3. We have k = 17550 or 37884 or 78552 or 98886 (mod 122004). If k = 
17550 or 98886 (mod 122004), then k = 18 or 30 (mod 36), which is impos- 
sible as (18, 37)g and (30, 37)g are helpful pairs. Hence k = 37884 or 78552 
(mod 122004). 

Step 4. We have k = 37884 or 78552 or 159888 or 200556 (mod 244008). If 
k = 37884 or 159888 or 200556 (mod 244008), then k = 12 or 36 or 48 
(mod 72), which is impossible as (12, 73)¢3, (36, 73)63 and (48, 73)¢3 are helpful 
pairs. Hence k = 78552 (mod 244008) and 2? - 3? | k. 

Step 5. We have k = 78552 or 322560 (mod 488016). In the case k = 78552 
(mod 488016) we have k = 8 (mod 16), which is impossible as (8, 17))3 is a 
helpful pair. Hence k = 322560 (mod 488016) and 2+ - 3? | k. 

Step 6. We have k = 322560 or 810576 (mod 976032). The case k = 810576 
(mod 976032) is impossible, since in this case k = 48 (mod 96) and (48, 97)g¢6 
is a helpful pair. Hence k = 322560 (mod 976032) and 2° - 3? | k. 

Step 7. We have k = 322560 or 1298592 (mod 1952064). In the case k = 
1298592 (mod 1952064) we have k = 288 (mod 576). The latter is impossible, 
since (288,577)432 is a helpful pair. Hence k = 322560 (mod 1952064) and 
OP, 3? | 

Step 8. We have k = 322560 or 2274624 or 4226688 (mod 5856192). The case 
k = 322560 (mod 5856192) is impossible, since in this case k = 288 
(mod 432) and (288, 433).g4 is a helpful pair. In the case k = 2274624 
(mod 5856192) we have k = 36 (mod 108), which is impossible as (36, 109). 
is a helpful pair. Hence k = 4226688 (mod 5856192) and 2° - 3° | k. 

Step9. We have k = 4226688 or 10082880 or 15939072 or 21795264 or 
27651456 (mod 29280960). If k = 15939072 or 27651456 (mod 29280960), 
then k = 6 or 12 (mod 30), which is impossible as (6,31); and (12,31) 
are helpful pairs. In the case k = 21795264 (mod 29280960) we have 
k = 24 (mod 60), which is impossible since (24,61)g is a helpful pair. The 
case k = 4226688 (mod 29280960) is also impossible, since in this case 
k = 108 (mod 180) and (108, 181)g9 is a helpful pair. Hence k = 10082880 
(mod 29280960) and 2° . 33 - 5 | k. 
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Step 10. We have k = 10082880 + 29280960; (mod 204966720) for some 
J € {0,1,...,6}. If 7 € {0,1,3, 6}, then k = 8 or 12 or 16 or 24 (mod 28), 
which is impossible as (8, 29), (12, 29), (16, 29)22 and (24, 29). are helpful 
pairs. If 7 = 4 or 5, then we have k = 6or 18 (mod 42). The latter is 
impossible, since (6, 43)2g and (18, 43)2s are helpful pairs. Hence k = 68644800 
(mod 204966720) and 2° - 37-5-7|k. 

Step 11. We have k = 68644800 + 204966720; (mod 1024833600) for some 
Jj € {0,1,2,3, 4}. If jj € {1,2,3}, then k = 20 or 40 or 60 (mod 100), which 
is impossible as (20, 101)12, (40, 101)j2 and (60, 101), are helpful pairs. The 
case j = 4 is also impossible, since in this case k = 3780 (mod 6300) and 
(3780, 6301)47g is a helpful pair. Hence k = 68644800 (mod 1024833600) and 
OA eee aa als 

Step 12. We have k = 68644800 + 10248336007 (mod 11273169600) for some 
J € {0,1,...,10}. If j € {0,2,3,6,8}, then k = 2 or 4 or 12 or 18 or 20 
(mod 22), which is impossible as (2, 23)17, (4, 23)17, (12, 23)17, (18, 23)17 and 
(20, 23)17 are helpful pairs. If j € {1,5, 7}, then k = 30 or 36 or 54 (mod 66), 
which is impossible since (30, 67) 12, (36, 67),2 and (54, 67); are helpful pairs. 
In the case j = 10 we have k = 16 (mod 88), which is impossible as 
(16, 89)15 is a helpful pair. The case 7 = 9 is also impossible, since in this case 
k = 160 (mod 352) and (160, 353)72 is a helpful pair. Hence k = 4167979200 
(mod 11273169600) and 2° - 33 .5*-7-11|k. 

Step 13. We have k = 4167979200 + 112731696007 (mod 146551204800) 
for some j € {0,1,...,12}. If 7 € {1,3,4,5,7,8,10}, then k = 
12 or 20 or 24 or 28 or 36 or 40 or 48 (mod 52), which is impossible as 
(12, 53)ag, (20, 53)ag, (24, 53)ag, (28, 53)ag, (36, 53)4g, (40, 53)4g and (48, 53)4g 
are helpful pairs. If 7 = 2 or 12, then k = 30 or 42 (mod 78), which is 
impossible, since (30,79)¢64 and (42,79)¢4 are helpful pairs. If 7 = O or 
6, then k = 60 or 110 (mod 130), which is impossible as (60, 131)og and 
(110, 131)og are helpful pairs. The case j = 9 is also impossible, since in this case 
k = 96 (mod 156) and (96, 157)28 is a helpful pair. Hence k = 128172844800 
(mod 146551204800) and 2° - 33. 5?-7-11-13|k. 


It seems that the type of argument used in the proof of Proposition 7.2 can be 
continued to deduce that more and more small prime factors must divide k. Given 
a prime g > 5 and 2 < t < g —3 even, one would heuristically expect that (t, q)a 
is helpful with probability (1 — 1/q)*~! which tends to 1/e, on assuming that the 
values S;(c) are randomly distributed modulo g. The numerical data obtained so far 
turn out to be consistent with this. 

For the original Erdé6s—Moser equation it is known (cf. [8, 19]) that NV | k with 


N = 28.3°.54.73. 117+ 137-17? - 19°. 23-+-997 > 5.7462 - 10%”. 
A heuristic argument can be given suggesting that if, say L, := Icm(1,2,...,v) 


divides k, with tremendously high likelihood we can infer that L,, divides k, where 
w is the smallest prime not dividing L,. It is already enough to have v > 11 here. To 
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deduce that k is divisible by say 24 might be delicate, but once one has L, | k say, 
there is an explosion of further helpful pairs one can use to establish divisibility of k 
by an even larger integer. To add the first prime w not dividing L,, one needs to have 
only a number of helpful pairs that is roughly linear in v, whereas an exponential 
number (in v) is available. However, the required computation time goes sharply up 
with increasing w. 

This result gives a lower bound of 10*?’ for k, which is modest in comparison 
with the lower bound obtained by Moser. However, as argued by Gallot et al. [6], 
a result of the form N | k leads to an expected lower bound m > 107°". For the 
Kellner—-Erd6s—Moser equation we likewise expect a result of the form N | k to lead 
to a lower bound for m that is exponential in N. 

Unfortunately, the authors are not aware of any systematic approach that would 
allow one to prove a result of the type that if aS,(m) = m*, then 120 | k, for every 
a > 1. Some preliminary work on this for the equation S,(m) = am* was done by 
the second author’s intern Muriel Lang in 2009. 


8 Lower Bound for m 


The aim of this section is to establish Theorem 8.4. The proof rests on Lemmas 8.1 
and 8.3. 


Lemma 8.1 Suppose that aS;(m) = m* with m => 4 and k even. Then m — 1 and 
2m—1 are square-free, and if p is a prime divisor of (n—1)(2m—1), then (p—1) | k. 


Proof Since 
Si(m— 1) = S(m) — (m— 1)* = S;(m) (mod m-— 1), 


Lemma 2.1 yields 


a) Ol: ih Gied aon= 1). (8.1) 


p\(m—1) 
(p—-D|k 


Note that if p | (m— 1) and (p— 1) + k, then p | m, a contradiction that shows 
that p | (m — 1) implies (p — 1) } k. If p? | (m — 1), it follows again that p | m, a 
contradiction that shows that m — 1 is square-free. 

Note that 


m—1 
Se(2m— 1) = Y°( + (2m— 1 —)') = 28,(m) (mod 2m — 1). 


j=l 
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Then, again by Lemma 2.1, 


a nT 2390 (asd dn), (8.2) 


p|(2m-1) 
(p-Dlk 


from which we deduce that 2m — 1 is square-free and each prime p dividing 2m — 1 
satisfies (p — 1) | k. 


Corollary 8.2. Suppose that aS,(m) = m* with m > 4 and k even, then m = 3 
(mod 4). 


Lemma 8.3 Suppose that aS;(m) = m* with m = 4 and k even and let p be a prime 
divisor of (m+1)(2m+1). If (p—1) | k then ord, ((m+1)(2m+1)) = ord,(a+1)+1, 
otherwise ord,((m + 1)(2m + 1)) < ord, (a+ 1). 


Proof Observe that aS;(m + 1) = (a+ 1)m*. Invoking Lemma 2.1, we obtain 


+1 
a y Glee (a+ 1)m* = 0 (mod a(m + 1)). (8.3) 
p\(m+1) 
(p-Dlk 


Since p | a implies p | m it follows that gcd(a,m + 1) = 1. Thus 
ord,(m + 1) = ord,(a+ 1) + 1 if p | (m+ 1) and (p—1)|k&, 
ord,(m + 1) < ord,(a + 1) ifp | (m+ 1) and(p—1) +k. 


Further, from 


aS; (2m+1) =a YoGk+ mt 1—j)*) = 2aS;,(m+1) = 2(a+1)m* (mod a(2m+ 1)) 
j=l 


we deduce that 


2 1 
ay = ep GA = 0 thd dm iy, (8.4) 
p\(2m+1) 
(p-D|k 
and so 
ord, (2m + 1) = ord,(a+ 1) +1 if p | (2m + 1) and (p— 1) | k, 
ord, (2m + 1) < ord, (a + 1) if p | (2m + 1) and (p—1) tk. 


Since m + | and 2m + 1 are coprime, the asserted result follows. 
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Part (g) below arose in collaboration with Jan Biithe and we only provide a sketch 
of the proof here. We remark that if the condition m = 1 (mod 30) is replaced by 


ae 
yo -4+->1, 
Aw? * 


(cf. Eq. (8.11)] the same conclusion holds true. 
Theorem 8.4 Assume that a > 1 is square-free and that aS;(m) = m* with m > 4 
and k even. Put ay = gcd(a+ 1,m-+ 1) and az = gcd(a+ 1,2m-+ 1). Put 


a (m? — 1)(4m? — 1) 
= 6a, a2 , 


M 


Then 


(a) m> a; 

(b) m— 1, 2m—1, (m+ 1)/ay, and (2m + 1)/az are all square-free; 

(c) if p divides at least one of the above four integers, then (p — 1) | k; 

(d) m > 3.4429. 10°; 

(e) the number M is square-free and has at least 139 prime factors; 

(f) ifm = 1 (mod 3), then m > 1.485-1072!!5 and the number M has at least 
4990906 prime factors; 

(g) ifm = 1 (mod 30), then m > 10*!0" and the number M has at least 
75760524354901799895 prime factors. 


Proof As ais square-free, we have a | m, and so m > a. If m = a, then (8.3) yields 


1 
y> = =0 (mod 1). 
p\(m+1) ? 
(p—I)|k 


Since the sum of reciprocals of distinct primes can never be a positive integer, we 
must have 


p\(m+1) 
(p-Dk 


which contradicts the fact that 2 | (m+ 1). Parts (b) and (c) are direct consequences 
of Lemmas 8.1 and 8.3. Further, using Lemma 8.3, parts (b) and (c) and the facts that 
a| mand gcd(a,m— 1) = ged(a, 2m— 1) = ged(a,m+ 1) = ged(a,2m+ 1) = 1, 
we find that 


m« = m (mod a(m— 1)), 


2mk = 4m (mod a(2m— 1)), 
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(a+ Im = a+1+4+ (a—1)(m+ 1) (mod a(m + 1)), 
2(at+ Im* = 2(a4+ 1) + (a— 2)(2m + 1) (mod a(2m + 1)). 


Here we will only provide details for the latter congruence, which is the most 
complicated one to establish. Since a and 2m + | are coprime, it suffices by the 
Chinese remainder theorem to establish the congruence modulo a and modulo 
2m + 1. Since a | m the congruence trivially holds modulo a. 

Now suppose that p | (2m + 1). 


First case: (p-—1)|k. 


By Lemma 8.3 we have ord, (2m + 1) = ord,(a+ 1) + 1 and it suffices to show that 
m* = 1 (mod p). This is true by Euler’s theorem. 


Second case: (p—l1)tk. 


Here we use that, by Lemma 8.3 again, ord,(2m + 1) < ord,(a + 1) to see that the 
congruence holds. 
We can rewrite the congruences (8.1)—(8.4) as 


1 m 
= 4-——__= = 0) (nod 1), (8.5) 
plm—1) ? age) 
1 4m 
yo + Cea a 0 (mod 1), (8.6) 
p\|Qm—1) P a ve 
eg ee ee 26 Gio), (8.7) 
pie P a(m +1) 
5 ee a 2 Gad i (8.8) 


py zat Pp a(2m + 1) 
ay 


By Corollary 8.2, the assumption that k is even and Lemma 8.3, we see that (m + 
1)/a; is even. Now noting that a > 37 (by Corollary 3.12 and the fact that 37 is the 
first irregular prime), we have 


a+1l+a(m+1) 1 


1 1 —1 1 
yy ee ee Pe a) ee 


1 
p a(m+ 1) = a(m+ 1) a 
pt 
a] 


Therefore, if we add the left-hand sides of (8.5)-(8.8), we get an integer, at least 5. 
No prime p > 3 can divide more than one of the integers m— 1, 2m—1, (m+ 1)/a,, 
and (2m + 1)/ap, and 2 and 3 divide precisely two of these integers. Hence M = 
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(m? — 1)(4m? — 1)/(6a)a2) is square-free and 


ye 1 re 2 at+1 2(a+ 1) 

m?P a(m—1)  a(2m—1) a(m+1)) aQm-+1) 

Pp 
>3 a wee (8.9) 
= oe Bg 


Since a | m,m > a > 37 and each prime divisor of m is irregular, we have m > 37. 
A simple computation shows that (8.5) is never satisfied for a > 37 and m = 37. 
Since 59 is the second irregular prime, it follows that m > 37-59. On noting that the 
four fractions above are decreasing functions in both a and m, we find on substituting 
a = 37 and m = 37-59 that paer ; > a, with a = 2.1657. Note that if 


a Ee a, (8.10) 


psx 


then m*/3 > M > Pe (note that a; > 2 and hence M < m*/3). One aes 
(using a computer algebra package) the largest prime p, such that ae £5 - < 22, 


with p1, p2,... the consecutive primes. Here one finds that s = 139 and 


139 


Y= < 2116566 <a < O~ 


j=l’ piM 


Thus 


1/4 
m> (3 I] r) > 3.4429- 10°. 


PSP139 
Now assume that m = 1 (mod 3). Then 3 | (2m + 1)/a, by Lemma 8.3, and so 


>> Dee eS 2 ee 
jaa P a(2m + 1) ae a(2m + 1) a/2 
P| 


Hence in this case, 22 in (8.9) can be replaced by 32. This @ occurs in the work 
of Moser and here it is known that s = 4990906, leading to m > 1.485 - 10%2!195 
(cf. [4, 16]). 

Finally, assume that m = 1 (mod 30). Then 


a ee eee (8.11) 
joe am—-1) 2 3 5 : : 
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and we have the inequality (8.9) with 22 replaced by 42. In this case the largest 


prime p, such that >> a ; < 4t can no longer be determined by direct computation 


and more sophisticated methods are needed, cf. [2]. 


Remark Note that in the proof we only used that a > 37. This has as a consequence 
that the proof only depends on the first assertion in Theorem 1.5. 


9 Proofs of the New Results Announced in the Introduction 


It remains to establish Theorem 1.5, Theorem 1.6 and Proposition 1.7. 


Proof of Theorem 1.5 The first restriction on a arises on invoking Corollary 3.12. 
In order to obtain the second restriction we have to write down all integers a < 
1500 that are composed only of irregular primes. These are listed in Table 3 in 
Appendix. Each of these can be excluded as is shown for two examples in Sect. 5. 
This exclusion process for each a can be reconstructed using Table 3 in Appendix. 


To prove Theorem 1.6 we need the following result, which shows that m and k 
are of comparable size. 


Lemma 9.1 Suppose that aS;(m) = m*. Thenk +1 < am < (a+ 1)(k+1). 
Proof We have 


S.(m) < | %ars Si(m+ 1)—1. 
1 


Hence 


Hs k+l _ k+1 
mt = asym) <a f stax =D 
k+1 k+1 
1 


and so am > k + 1. Further, 


(a + 1)m* = aS, (m + 1) > a(t + [#a) = 
1 


that is am < (a+ 1)(kK+ 1). 


Proof of Theorem 1.6 The lower bound on m is a consequence of Theorem 8.4, part 
(d). On invoking Lemma 9.1 with a > 1501 the result follows. 
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Proof of Proposition 1.7 Let (m, k) be a solution of (1.3). Observe that 


ae = Cre Cys (“—) - aed ny 


m m mit 
and so 
aS;(m) — m! > 0 if j <k, 
(9.1) 
aS;(m) —m <0 ifj >k. 


This shows that for every m, there is at most one k. 
Now assume that there exists a positive integer n such that aS,(m+n) = (m-+n)*. 
Then k > 1. Since 


m—1 k 
Se(m +n) = Sen) + Dn + jk = Sen) + mn + YO () n7ISi(m), 


J=0 j=l 


we have 


k k 
(m+n) = >. (') mink = aS;,(n) + amnk + a > (') nI§(m), 


j=0 j=l 
or, equivalently, 


k-1 
nk = aS;,(n) + amn* + a (') ni (aS;(m) —n). 


j=l 


In view of (9.1), the last equality cannot hold. Thus we see that, for every k, there is 
at most one m. 


Remark Using the same type of argument, we can prove the following: if (77, k;) 
and (mp, k) are two distinct solutions of aS,(m) = m*, then either m, > mz, k, > ky 
orm, < m), ky < kp. 


10 Other Properties of Kellner-Erdés—Moser Solutions 


There are many restrictions known that a solution of the Erd6s—Moser equation has 
to satisfy. We expect that most of these have an analogue for the Kellner—Erd6s— 
Moser equation as well. We present an example. 
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Theorem 10.1 Suppose that aS;(m) = m* with m > 4 and k even. We have 


= dk ifa=1(mod 4 
ati h-o)) OO a 
>4+ord.k ifa=3(mod 4). 


For a = | this was first established in Moree et al. [19, Lemma 12] using Bernoulli 
numbers. A much more elementary proof also dealing with solutions of the equation 
S;(m) = am* for integers a was given later by the second author [15]. 

Our proof of Theorem 10.1 makes use of the following lemma. 


Lemma 10.2 Let k and m be positive integers where k => 6 is even. Then 


[2] 4 92+ ords k (mod 23-+ord2 ky if [2] =? (mod 4), 


in 
2 
%| (mod 23+er2*) otherwise. 


[2 


Proof It is easily proved by induction on s that for an odd integer j and s > 1 


Si(m) = 


as _ )1 (mod 25+3) if; = +1 (mod 8), 
+241 (mod 2°+3) if j= £3 (mod 8). 


Note that k > 3 + ord. k. Indeed, for ord2 k = | and ord2 k = 2 it follows from the 
condition k > 6 and for ord) k > 3 we have k > 2%2* > 3 4+ ord k. Thus for an 
integer j we have 


0 (mod 234024) if j is even, 
i = 41 (mod 23424) ifj = +1 (mod 8), 
g2rordak + 1 (nod 23+") if j = +3 (mod 8). 


Assume that m = 1 (mod 4). Then 


-—1 
#{L <n<m : n= £1 (mod 8)} = #{1 <n <m : n= £3 (mod 8)} =. 
Hence 
m1 m=! rtordgk M1 itordek 
S,(m) = —— + —— (27% 4 1) = (gitonak 4. 1) 
4 4 2 
mt (mod 23tord2 ky ifm=1 (mod 8), 


i + 2?tord2k (mod 23+) ifm = 5 (mod 8). 
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Now assume that m = 3 (mod 4). Then 


rl 
#1 <n Sm: n= 41 (mod 8)} = #{1 Sn <m : n= 43(mod 8)} =~. 


This yields 
1 1 1 
Se(m + 1) — an + iia erat + 1) = m + (art 4 1) 
4 4 2 
_ \ 2 (mod 2340824) ifPm= > nbd 8, 
met 4 92+ ordg k (mod 93-+ordz ky ifm =3 (mod 8). 
Since 


jn 1 (mod g3+ord2 k) ifm=7 (mod 8), 
~~ ) 92+ord2 k +1 (mod 2340 id if m = 3 (mod 8), 


we deduce that 
pa eed 3-Fordy k 
Se(m) = Sk(m + 1) — mt = a (mod 2 2”), 
Finally, if m is even, then 


% (mod 23+0rd24) ifm # 4 (mod 8), 


m & g2-+ords k (mod 93+ ordy k) ifm=4 (mod 8). 


This completes the proof. 
Proof of Theorem 10.1 By Lemma 10.2 and Corollaries 3.8 and 8.2, 


=| 
S,(m) = — (mod 234982), 


From Lemma 8.3 we see that 


7(mod 8) if a=3 (mod 4), 
n= 
3(mod 8) ifa=1 (mod 4). 


Therefore 


aS;(m) =a- 


m—-1___, 41 (mod 23 torah) if a =3 (mod 4), 
=m = 
22tord2k +1 (mod 23+02*) if a =1 (mod 4), 
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and so 


0 (mod 244024) if a = 3 (mod 4), 
23+ ords k (mod 24+ ords ky tia= 1 (mod 4), 


a(m—1)-2= 


as desired. 


Theorem 10.3 Let a > 3. If aS,(m) = m* and mis a prime, then a = q*° for some 
irregular prime q = 3 (mod 16) and positive integer s. 


Proof Tf a has at least two distinct prime divisors, then m cannot be a prime. Now 
assume that a is a power of a prime gq. Then gq is an irregular prime and m = q. 
Suppose that a = q***! for some s > 0. Then (a + 1)m*/a(m + 1) = (qt! + 
1)S%(q)/(q + 1) is an integer, and (8.3) implies 


2 ‘20 (mod 1). 


P\(q+1) 
(p—-Dlk 


Exactly as in the proof of Theorem 8.4, we conclude that this is impossible. Finally, 
assume that a = q** for some s > 1. Then a = 1 (mod 8). Note that k has to 
be even. By Theorem 10.1 it follows that a(qg — 1) = 2 (mod 16), which yields 
i = 1 (mod 8), that is, g = 3 (mod 16). 


Remark It is not known whether there are infinitely many irregular primes g = 
3 (mod 16). However, from a result of Metsankyla [13] it follows that there are 
infinitely many irregular primes g = +3, +5 (mod 16). 
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Table 1 Pairs of irregular primes (p;,p2) with py < pz, pip2 < 50000, satisfying the conditions 
of Corollary 3.13 


P\ P2 

37 67, 103, 149, 157, 271, 307, 379, 401, 409, 421, 433, 463, 523, 541, 547, 
557, 577, 593, 607, 613, 631, 673, 727, 757, 811, 877, 881, 1061, 1117, 
1129, 1153, 1193, 1201, 1237, 1297, 1327 


59 233, 523 

67 103, 157, 271, 283, 409, 421, 433, 463, 541, 547, 613, 617, 619, 683, 691, 
727 

101 131, 149, 157, 271, 311, 401, 409, 421, 433, 461 

103 157, 283, 307, 463 

131 157, 271 

149 233, 257, 293 

157 233, 257, 271, 293, 307 


Table 2 Helpful pairs (tf, q)a with g < 17 


e 
~ 
— 


3, 7,9, 11, 12, 14, 15 
1, 2, 3, 5, 7, 8, 12, 14, 15 
3, 4, 6, 7, 9, 11, 12, 14, 15 


e 
~ 


q a (mod 4) 

5 1, 2,3 

7 1, 3,5 

7 1,3, 

1, 2, 3, 5, 6,7 

i 2, 3, 6, 7,9 

mn | 6. | L367 

1 6{| 8 (| 67,9 

13 1, 2, 5, 6, 8, 10, 11 

13 1, 2, 6, 8, 11 

13 | 6 | 2,3,4,5,7,8,9, 10,11 

13. | 8 | 1,2,6,8,11 

13 1, 2,4, 6, 7,8, 11 

17 1,2, 5,7, 8, 10, 13, 14 

17 1, 23, 5.8,9) Wy 12,15 

17 | 6 | 1,2,4,5,6,8 

17 | 8 | 2,3,4,5,6,7,9, 10, 11, 12, 13, 14, 15 
fa 


x 
an 
i 
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Table 3 Irregular pairs (r, p) along with the corresponding helpful pairs (4, qj)a(mod q;) Satisfying 
the conditions of Proposition 4.5 


a Helpful pairs (jain 
37 (8, 13) 

59 | ~~ (44,59) | 6s (2, 7)3, (4, 7)3, (6, 13)7, (276, 349)50 
67 (4, 13)2, (10, 13) 

101 | 68,101), | 6s (2, 7), (4, 7)s, (28, 41) 19, (168, 601) 101 


103 (24, 103) 14 | (2,5)3, (16, 29)16, (24, 43)17, (92, 239) 103, 
(194, 239) 193, (228, 239) 193, (636, 1429) 103, 
(840, 1429) 103 


131 (22, 131) (2,5)1, (12, 41)g, (152, 521) 131 


149 (130, 149) 15 | (2, 1)6, (4, Do, (6, 116, (8, 16, 2, 13)6, 
(10, 13)¢, (30, 61)27 


157 2.5) 

2.5) 
33 (26.59) 
251 (4.17); 


263 (100, 263) 30 (2,5)3, (2, 31)1s, (4, 31)is, (8, 3115, (14, 3115, 
(16, 31)15, (22, 31)15, (28, 31)15, (40, 61) 19, 
(624, 787) 263, (2196, 2621) 63, (2720, 3931) 263 


271 (4.11) 
283 (2.7)3 
293 (156, 293) 30 | (2, 11)7. (4, 11)7. (6, 11)7, (8, 11)7, (20, 41). 
mr hs (40, 61)49. (740, 877) 293. (1032, 1753) 293 
307 (88, 103)101 
311 (2, 11)3 
347 30 | (2, 5)2, (2, 16. (4, 116, (6, Lo. (8, 1D. 
(10, 31)6, (20, 61)42. (972, 1039)347 
353 2.5); 
800,353) |_| (2,7). (4, T)3. 12. 37)a0. (24, 37) 20. (36, 73)o1 
379 (4.71 
(48, 127)o9 
389 (4, 17)15, (8, 17)15. (12, 17)15. (976, 1553)3g9 
401 (2,5) 
409 (6, 19)19, (12, 19)19, (18, 37)2 
421 (20, 41) 11, (240, 281) 40 
433 (2,5)s 
461 (12, 47)38 
463 (4. 7)1 
467 (2, 7). (4, 7)5, (1026, 1399) 467 


(194, 467) 18 | (2, 5)o, (2, 7)s, (4, Ts, (12, 37)23, (24, 3723, 
(3456, 8389)467 


(continued) 
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Table 3 (continued) 


a Help pais Gatos 
wi @.11y 
(6.1) 
@.1y 
523 Ns 
54 0.5), 
547 (36.79) 
(2.5) (96.157 
557 2.5) 
571 (4.7) 


587 (90, 587) (2, 13)>. (4, 13)2, (6, 13)2, (8, 13)>. (10, 13)2, 
(92, 587) (2, 13)>. (4, 13)2, (6, 13)2, (8, 13)>. (10, 13)2, 


593 (2.5)3 
607 (4.7)5 
613 (2,5)3 
617 (20,29) 
@.5) 
0.5) 
619 (2.7)3 
631 @.) 
7) 
647 | ~~ (236,647) | 6 «| (2, 5), (2, 7)3, (4, 7)3, (84, 229) ig9 
Ee (2, 7)3, (4, 7)3, (72, 103)29 


ee | 647) (2,5), (2, 7)3. (4, 7)3, (180, 409) a3, 
(288, 457) 190 
653 (48, 653) (8, 17)7, (12, 17)7, (4, 31)2, (14, 31)o, (12, 41)38, 
Bost (20, 41)3g, (28, 41)3s, (24, 61)43, (36, 61 )as, 
(196, 241) 171, (48, 2609) 653, (9176, 9781)653 
659 Ee eee «| 659) (2::7)4,(4, Dis 6, 13)5, 12,19) 145 (42; 12 Doe, 


(882, 5923)es9 
on Se (17) 
2.5); 
677 (4, 13)1, (8, 13)1, (30, 79)as 


683 (32, 683) 12 | (2, 5)3, (32, 67)13, (76, 89)60, (280, 373)310, 
(2760, 4093)6g3 


or | aa6 | 1 [2,30 
000.691) [1 [2.0 

| am. | 1 (4.23) 

7st | 290.751) | 1 | (40,251)zu 


RPlRel/Rele 
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Table 3 (continued) 


a Irregular pair (r, p) Helpful pairs (4), 4))a(mod g;) 
757 (514, 757) (2,5)2 
761 (260, 761) (20, 41)o3 


713 (732, 773) (2, 7)3. (4, 7)3. (2. 11)3, (4, 11)3, (4, 17), 
(12, 17), (16, 41)35, (120, 241)50, 
(1118, 1931)773, (1504, 3089)773 
797 (220, 797) (8, 37) 20, (20, 37) 20, (220, 2389) 797, 
ae ey (1812, 2389)707, (2210, 3583)797 
809 (2, 116, (4, 16. (6, 116, (8. 116, (10, 41)30 
(2, 11), (4, 11)g, (6, 11)g, (8, 11)¢, (20, 41)39 


811 (544, 811) (4, 19)13 
821 (744, 821) (4,11); 


827 (102, 827) (2, 13)g, (4, 13)g, (6, 13), (8, 13), (10, 13), 
(2580, 4957) 397 


839 (66, 839) 1680 (2, 11)3, (4, 11), (6, 13)7, (10, 13)7, (8, 17)6, 
(2, 29)o7, (4, 29)a7, (8, 29)27, (10, 29)o7, 
(12, 29)o7, (14, 29)o7, (16, 29)a7, (18, 29)o7, 
(20, 29)27, (22, 29)o7, (10, 31)2, (20, 31)2, 
(10, 41)19, (16, 41)19, (20, 41)19, (26, 41)19, 
(28, 41) 19, (30, 41) 19, (38, 41) 19, (28, 6146, 
(48, 61) 46, (56, 61)46, (28, 71)58, (56, 71)ss, 
(4, 9763, (52, 97) 63, (76, 97)63, (26, 211) 206, 
(66, 211)206, (6, 281)277, (80, 281)277, 
(138, 281)o77, (248, 281)o77, (258, 281)o77, 
(220, 673) 166, (17664, 20113)g39 


877 (4, 13)6 
881 2.5) 
887 (2, 7)s, (4, 7)s, (2190, 2659)gg7 
929 (17) 
(4,17) 
953 (20, 137)131 


971 (166, 971) 42 (2, 7)s, (4, 7)s, (2, 29)14, (16, 29) 14, (18, 43)as, 
(24, 43)o5, (36, 43)a5, (26, 71)ag. (56, 71)ag, 
(6, 211)127 


1061 (2,5): 
1091 | = (888,1091) | 8s (2, 5)1, (4, 17)s, (8, 17)3, (12, 17), (8, 4D)2s 
117 Q.5) 
1129 (2, 11)7, (4, 11)7. (6, 11)7, (8, 11)7, (20, 41) 29 
nist | 36.151) | «(4.1 

(1) 

(8.1); 
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Table 3 (continued) 


a 
1153 
1193 
1201 
1217 


1229 


1237 
1279 
1283 


1291 


1297 


1301 
1307 


1319 


1327 
1367 
1369 
1381 
1409 
1429 
1439 


1483 
1499 


66,1217) | 
(iis, }(1118,1217), | 3 


(784, 1229) e 


(874, 1237) | 1 | 
(518, 1279) 
10 


(510, 1283) 


(304, 1319) 


| (234.1367) | 6 
| _358,1409) | 
(996, (9961429) | 


(574, 1439) 7 


(224, 1483) i a 


(94, 1499) 


Helpful pairs (¢;, g))a(mod qj) 
(2,5)3 
(2,5)3 
(4,17) 
(4, 13)g, (8, 13)g, (48, 97)s53 
(2,17)10 
(2, 13)g, (6, 13)g, (10, 13) 
(4, 17)s, (8, 17)s, (12, 17)s, (16, 29) 11, 
(32, 113)o9, (784, 8597) 1229, (2012, 8597) 1229, 
(3240, 8597) 1299, (5696, 8597) 1229, 
(6924, 8597) 1929 
(2, 5)2 
(2,7)s 
(2, 5)3, (2, 17, (4, 11)7, (6, 117, (8, 117, 
(6920, 12821) 1283 
(2,5)1, (56, 61)10 
(14, 31)20 
(2,5)2 
(12, 17)s 
(20, 53)29 
(2, 7)s5, (4, 7)s, (2994, 3919) 1307 
(2, 7)s, (4, 7)s, (852, 3919) 1307 
(2, 7)3, (4, 7)3, (8, 17)10, (6, 3117, 02, 3D17, 
(18, 31)17, (24, 3117, (6, 37)24, (12, 37) 24, 
(18, 37)24, (30, 73)s, (36, 73)s, (66, 73)s, 
(2940, 11863) 1319, (42480, 52721) 1319 
(2, 5)2, (4, 13)1 
(2, 5), (4, 13)a, (8, 13)2, (234, 4099) 1367 
(2, 11)s, (6, 11)5, (14, 31)s, (8, 41) 16, (20, 61)27 
(2,5)1 
(6, 23)6 
(44, 239)o34 
(4, 11)o, (8, 11)o, (2, 19)14, (4, 19)14, (8, 1914, 
(10, 19)14, 12, 19)14, (2, 3D)13, (6, 3113, 
(12, 31)13, (22, 31)13, (26, 31)13, (24, 37)33, 
(6, 43)20, (24, 43)20, (10, 61)36, (30, 61)36, 
(40, 61)36, (30, 71)19, (40, 71) 19, (36, 127)a2, 
(16, 181)172, (50, 181)172, (140, 181)172, 
(60, 211)173, (574, 8629) 1439, (50904, 60397) 1439 
(2, 13);, (8, 13); 
(2,7)1, (4, 7)1, (©, 13)4, (4, 17)s, (8, 17)s, 
(10, 17)3, (12, 17)3, (14, 17)3, (736, 857)642 
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A Note on the Negative Pell Equation 


Valentin Blomer 


To the memory of Wolfgang Schwarz 


Abstract Nagell conjectured in the 1930s that the set of discriminants for which 
the negative Pell equation has an integral solution has an explicitly given positive 
proportion within the set of discriminants having no prime factor congruent to 3 
modulo 4. In a series of papers, Fouvry and Kliiners succeeded in showing that 
the order of magnitude of such discriminants up to x is indeed x(log x)~'/*. Here 
we present a short independent argument that the order of magnitude is at least 
x(log x)~°-, 


Keywords Fundamental unit * Negative norm * Number of discriminants 
Pell equation 


2010 Mathematics subject classification: Primary 11N45; Secondary 11D09 


1 Introduction 


There are many open questions in connection with the solvability and the size of 
possible solutions of Pell-type equations 


x—dyv=C, x,yeZ, (1.1) 
for given squarefree d > 0. Let us write K := Q(/d) and 


D:= disc K/Q =f'd, f € {1,2}. 
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While (1.1) has always integral solutions for C = 1, only some d admit integral 
solutions to 


=07 Sa (1.2) 


This is tantamount to asking for which real quadratic discriminants D the fundamen- 
tal unit €p has negative norm, Nep = —1. Let be the set of such discriminants. It 
is well known that D € © if and only if the period length of the continued fraction 
of V/d is odd (which, however, might be as long as about /d). In more algebraic 
terms, (1.2) is solvable in x, y € Z if and only if the class group of K coincides with 
the narrow class group, in other words if the narrow Hilbert class field of K is real. 
It is, however, a hard problem to find necessary and sufficient criteria for D that are 
both efficient from an algorithmic point of view and simple enough for theoretical 
purposes, e.g. counting the number of D € D up to x. 

Let S be the set of integers having no prime divisor p = 3 (mod 4). By the 
Hasse principle we see that (1.2) admits rational solutions if and only if d € S, in 
particular 


x 


Jlog x 


On the other hand, by an observation of Redei the negative Pell equation does have 

integral solutions if d € S and in addition the two-part C[2] of the class group is 

isomorphic to (Z/2Z)°)—", that is, the class group C of K has no element of order 

4. For, in the latter case, the narrow 2-Hilbert class field coincides with the genus 

field and is in particular real. In other words, if D denotes the set of discriminants of 

real quadratic fields in S with no element of order 4 in the class group, then D C 9. 
On probabilistic grounds, one may expect 


#{D <x|D € D} < #{discriminants D <x|DeES}<« 


Xx 


Jlogx’ 


more precisely, there is theoretical and (some) empirical evidence [8] that 


HD <x|DeED}~x (1.3) 


#HD<x|DeED} -TT(1 I 


Fe oe ~ 0.581, x oo. 
#{discriminants D < x | D € S} 


~ 92j=1 


(1.4) 


Nagell [6] conjectured more than 80 years ago that the limit (1.4) exists in the open 
interval (0, 1), but no proof has been found so far. The beauty in this conjecture lies 
in the mix of analytic and algebraic number theory that is already apparent in the 
formulation of the problem. 
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A first step was made by Cremona and Odoni who proved ([1], see also [8]) 


#D <x|DeED,a(D) =n} sb 1 
a ee a Il 1- —— > XO we 
#{discriminants D < x | D€ S,w(D) =n} ft Deh 
(1.5) 
for each fixed n € N, so that in particular 
log] e 
HD <x|D€D} >, osloe" (1.6) 
logx 


In an impressive series of long and difficult papers, Fouvry and Kliiners [3-5] 
made great progress and essentially solved the problem (along with several related 
questions) by showing that the left-hand side of (1.4) is asymptotically bounded 
between two constants: 


foe) 


0.524 <2T] 1 1 Lei= #{D <x|DeD} 2 (1) 
| oi ae ea ae #{discriminants D < x | D € S} ~ 3 ee 
Our goal in this note is much more modest. Here we show that a short argument 
suffices to bring us half-way between (1.6) and the correct order of magnitude 
a(logx) 2, 


Theorem 1.1 Let ap = 0.616... be the minimum of the function 


1—alog2 
f(a) := 1—a{[ 1+ log ———— ]}. 
2a 
Then 
#{D <x|DeD} >, ————_— 
t <x| > (log x)%+¢ 


for any € > 0. 


The general approach is similar as in [1], but we modify the argument as follows: 
(a) we only aim at getting lower bounds for the quantity on the left-hand side 
of (1.5), (b) we use the large sieve to estimate primes in arithmetic progressions 
on average and (c) we exclude exceptional (Siegel) discriminants at an early stage 
of the argument. The idea of the method dates back several years, and Theorem 1.1 
is already mentioned in [4]. 
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2 Proof of the Theorem 


We start with the Siegel—Walfisz Theorem (e.g. [2, Sect. 20]). Let (a, q) = 1. There 
is a constant C > 0 such that 
Ole 0) (2.1) 


R= ~ o@) sal, ion log? 


dX x 
n=a (mod q) 


for all g < ev'°2*, except possibly for a thin sequence g; < qz < ... satisfying 
qjt1 > qj In particular, there is a set of rational primes P := {71,72,...} with 


7, > 22" ' such that (2.1) holds for all g < e¥'°* not divisible by any of the Tj. 
Our aim is to find a lower bound for the number of D € D which implies 
automatically a lower bound for the number of D € ®. To this end we utilize a 
criterion of Redei—Reichardt to characterize the 4-rank of the class group. Let 
={DEN|w*(D) = 1, p|D= (p= 1 (mod 4), p ¢ P)}, 
D* := {De D* | w(D) = n}, 


and let 


0 +1 
i= (tae |e = Ofer) = hap (1) 01 = 1<j = n)=}( ca 
0 0 


For any D = Te pi € DF let (ej) =: U(D) € LU, be given by ey := (2) for 


i <j. Then the 4-rank of the class group of Q(VD) is determined by U/(D) in the 
following way. Let U(D) := (nj) € F5*" be defined by (—1)"* = e, fori < j, 
Ny = Na and ny = Dixi nij- Then by a criterion of Redei—Reichardt [7] (see also 
[8]), 


4-rank of the class group of Q(WD) = n—1— tkp,U(D). (2.2) 


We shall need the following lemma [1, Proposition 2.5]: 


Lemma 2.1 Out of the 2(3) matrices U € UU, exactly 


n/2 
OTT (1-3 


=) 330) 


matrices give rise to a matrix U having rank n — | over Fo. 


A Note on the Negative Pell Equation 35 


For any VY € LU, let 
D* (x, V) := #{D € D® | D < x, U(D) = V}. 


The idea is to estimate this quantity from below, sum over all admissible V and then 
optimize the value of n. To this end, let us fix a small ¢ > 0, let C(e) be a sufficiently 
large constant depending only on ¢, and let 


fe n—1 
i= Cer, 


In the following, all implied constant may depend on e, but not on n. We shall show 
by induction: 


ny x (log log x — log log y)""! 


D*(x,V) >, 27 G 34 
ee logx 2"(n — 1)! a 


for any V € LL, and x > y, where 


- c(e) exp(j*) 
a, = | — ——— ] > 1 
i ( exp(c; ,/log -) 


for a suitable constant c(e) > 0. Here we choose C(e) large enough (in terms of 
c(€) and cj) so that each factor is positive. We will also choose n < log log x, so that 
Yn < exp(log C(e) - (log x)°’) for ¢ sufficiently small and the condition y, < x is 


not void. 
From Lemma 2.1, (2.2), (2.3) and the definition of D, we find 


= = _ n—1 
fe pips as 2. Cee Dee Ft) = alee Cle) 
logx 2"(n—1)! 


for n < log log x (and sufficiently large x). Choosing n = [a log log x] for some real 
0 <a <1, this is by Stirling’s formula 


x (- —alog2 e mmm 


= x(logx)f@ +o), 
5 . x(log x) 


logx 


with f(@) as in Theorem 1.1, and the proof follows. 

The rest of the paper is devoted to the proof of (2.3). This is certainly true for n = 
1 by the prime number theorem (or Chebyshev-type estimates) for the progression 1 
(mod 4). Suppose now (2.3) holds for some n € N, and let V = (€y)1<ijenti € Un+1 
be given. LetU/ := (€j)1<ij<n € LU, be obtained by deleting the last row and column 
from V. Let us consider some t = [|'_, p; € D*(x,U). We denote by A, the set of 
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residue classes a (mod 4/) satisfying 
= Pi ; 
a = | (mod 4) and (=) = Eintt, 1 <i<n. 
a 


By the Chinese remainder theorem we have 


p(4t) 


HA, = Qnt1 


If p satisfies the three conditions p = a (mod 4ft) for some a € A;, Pn < p < x/t, 
and p ¢ P, then pt € D%, ,(x, V). Let us assume x > yn41, and write 


y = x/2, 


Without loss of generality we also assume that x = [x] + 5 is a half-integer. Then 
we have 


* x 
DiV)= Yo # lpn <p $= |p = a(mod 4p ¢ P} 
t=] pi€D* (UU) GEA, 


~~ Yin (F:4.4) £0 (30 ee ats <x|a aa) 


t€D* (yU) ae A, tSy 


me > 2G)+(D DE 
(2.4) 


II 


II 


(x/t) 
z (7 as a) = ean 


teD¥* (yU) t<y aeA; 


+ O(x'), 


where )>” indicates that the summation is taken over f coprime to all z € P. By 
the prime number theorem and partial summation, the main term is 


IV 


1 i dt ( x/t 
— ey | a ee 
2" To, 2 logt dX exp(C2 /log(x/t)) 


(2.5) 


IV 


x [ D*(u, V) 4 +0( x ) 
— du ——___——_}: 
antl J, uw? log(x/u) exp(c3/log x) 
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note that y < x!/?, so that log + > log x. By the induction hypothesis (2.3) the main 
term here is 
a y = n—1 
. ,2-( a) x / (log log u — log log y,) du 
= 2"+1 log x 2"(n — 1)lulogu 


n 


(2.6) 


— 5 o-(4) x (log log y — log log y,)” 
7 2"+! log x 2"n! 


= 5-("$!) x (loglogx — log =~ — log log y,)” 
= log x 2tIn! 


It remains to estimate the first error term on the right-hand side of (2.4). This is 
a standard application of the large sieve. We split the t-sum in O(logx) dyadic 
segments. A typical of these terms is at most 


* * 1 
St oto (2.7) 


Z<t<2z a(mod 41) pSx/t pSx/t 
p=a (mod 4) 


for z < y. It is convenient to make the summation range independent of t¢ by 
replacing the condition p < x/t with p < x/z. This can be easily done by integral 
transforms. We use the formula (see, e.g., [2, p. 165]) 


[. oie sin) og _ 1+ O((T(B — |a|))~"), lal < B, 
st mg O(T(la|— By), jal > B 


for T, B > 0 with B = log X, a = log m getting 


T . 
_ ig Sin(é log X) 1 
an = i > aAnm —ar CO + O T ze lain| 


m<X =f m<Y 


m\-! 
od 
%y 


for real numbers Y > X and any sequence (a,,). In particular, (2.7) is at most 


T _fx« \ * 5 if 1 ig] 1 
DL | Lv gay Lvs min ( tows) at 
—T 2<1<22 a (mod 41) p<x/z $( t) pSx/z [5 
p=a (mod 41) 


1 * * pt\—! 1 pt|—} 
+O|5>, Ds ( Dd foes] + aay DL fees 
ZSt<2z a(mod 41) psx/z psx/z 


p=a (mod 41) 


(2.8) 
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for some 0 < T < x to be chosen later. Gluing together ¢ and p, the error term 
in (2.8) is at most 


x(log x)? 


1 q\7} 
7X40 [oe 
Si Ae ee | 


q2x 


; (2.9) 


here we need our assumption that x is not too close to an integer. The main term 
in (2.8) is by Cauchy’s inequality 


1/2 
zmin (= Gh Jog.) dé. 


We open the square and express each character modulo 4f in terms of its underlying 
primitive character modulo ¢ = 4t/€. Using O(£1’) = 6(£)d(7) we get 


1/2 
‘ zmin (a Gh Jog: dé. 


(2.10) 


Paes 
«[(¥2 Eilat EH E x0 


ZXt<2z a(mod 41) psx/z 


[, (Cam TG) zm "| xo 


mod ae 
(p,O=1 


We recall that the asterisk at the /’-sum denotes non-exceptional discriminants and 
the asterisk at the y-sum denotes primitivity. We split the ¢’-sum into two parts, 
> Pandt’ < P for some P < exp(./logx) to be chosen later. By the large sieve 
inequality (e.g. [2, p. 160]), the /’-sum restricted to > P can be bounded by 


«($+3)*(2): 


summing over £ and integrating over & we get 


x(log x)3/? 
JP 


since z < y = x"-®)/2, Let us now deal with the terms f < P. In this case, the 
innermost p-sum in (2.10) equals 


xO YS a. 


b (mod t’) psx/z 
p=b (mod 1’) 
(p.)=1 


+x!" logx (2.11) 
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By partial summation, and (2.1) where we use the fact that we have excluded 
exceptional characters, this double sum is 


< exp (-c’ ve *) (1 + |&)) 


for some absolute constant 0 < C’ < 1 (without loss of generality). Therefore the 
terms ¢/ < P contribute at most 


Cc’ | ) Cc 
P?/?x(T + log x) exp (-S log < « P?/?x(T + log x) exp (-< vy log *) 
Zz 
(2.12) 


1/2 


to (2.10) where we used again z < x’/~. Now we choose 


C C 
T := exp (5 Vives) , P:=exp (Sve) : 


10 


Collecting the error terms (2.9), (2.11) and (2.12), we see that (2.7) is 


Cc’ 
< x(log x)” exp (-5 Vives) ; 


and thus the total error in (2.4) and (2.5) is at most 


O (xex (- min (5: a) vive) ) 


For x > y,+1, the main term (2.6) is 


n _ 2)\n 
S97 =) x (log(1 + ¢—2¢é*)) 


x 2 
> 27+) ' 
logx 2rtIn! * logx 


Thus we can bound (2.4) from below by 


a) x (loglogx — log log yn41)" a exp((n + 1)) 
logx 2"+In! 7 exp (ci / log Yn+1) 


Q) 


n 


for x > Y,41 and some c; > 0. This is exactly (2.3) forn + 1. 
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when 2-8) > logy. 


Keywords Distribution of divisors * Integers with k prime factors 
2010 Mathematics subject classification: Primary 11N25 


Résumé A !’appui d’un principe général peu connu selon lequel la structure de la 
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1 Introduction 


La localisation de diviseurs dans des intervalles dyadiques pose un délicat probléme 
de crible: aucune relation de divisibilité ne lie les entiers d’un intervalle ]y, 2y] 
(y = 1), mais les ensembles de multiples de deux éléments quelconques ne sont 
généralement pas indépendants. 

Considérons la densité ¢, de ensemble .%, des entiers naturels ayant au moins 
un diviseur d tel que y < d < 2y. Besicovitch [3] a établi en 1934 que 


lim inf e, = 0 
yoo 7 


et en a déduit une remarquable réfutation de la conjecture en vigueur a l’époque 
selon laquelle tout ensemble de multiples admettrait une densité naturelle. 

Erdés [5] a montré en 1936 que ¢, — 0. Cependant, la détermination du véritable 
ordre de grandeur de ¢, a longtemps constitué un véritable défi pour les spécialistes. 
Précisant une estimation du second auteur [10], Ford [7] a montré en 2008 que l’ona 


1 


—— >3 
** Tog ilog,yz? 


ot l’on a posé 6 := 1 — (1 + log, 2)/log2 = 0.08607 et ot, ici et dans la suite, 
nous désignons par log, la k-iéme itérée de la fonction logarithme. Nous renvoyons 
le lecteur 4 l’ article de synthése [13] pour une mise en perspective plus complete de 
ce probléme et de ses ramifications. 

La valeur de l’exposant 5 peut étre devinée facilement : la croissance normale des 
facteurs premiers d’un entier n étant doublement exponentielle (voir par exemple [6] 
et le chapitre 1 de [8]), on peut approcher, en premiére analyse, le nombre des 
diviseurs de n n’excédant pas 2y par 2°»), ot w(n,y) désigne le nombre des 
facteurs premiers n’excédant pas y ; si l’on fait ’ hypothése que tous les intervalles 
dyadiques recoivent un quota statistique de diviseurs, on est amené a conjecturer que 
les entiers de % sont caractérisés par la condition 2°») >> log y. Or la densité de 
l’ensemble des entiers n tels que w(n, y) > (log, y)/ log 2+ O(1) vaut classiquement 
1/(log y)° +? — voir par exemple [14], ch. IL.6. 

Nous nous intéressons ici 4 un phénoméne particulier, connu des spécialistes 
mais qui, 4 notre connaissance, n’a jamais été clairement mis en évidence dans 
la littérature : la structure locale de la suite des facteurs premiers d’un entier est 
statistiquement gouvernée par son cardinal global. A Vappui de ce principe, et 
dans l’intention d’en fournir une illustration emblématique, nous nous proposons 
d’examiner la probabilité conditionnelle pour qu’un entier n ayant k facteurs 
premiers posséde un diviseur dans un intervalle ]y,2y] donné. Conformément a 
V’heuristique développée plus haut, la conclusion qualitative de nos résultats est que 
cette probabilité est minorée par une constante positive lorsque w(n) := (n,n) 
dépasse sensiblement (log, n)/ log 2 alors qu’elle tend vers 0 dans le cas contraire. 
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Désignons par &(x) l’ensemble des entiers n < x tels que w(n) = k, par m(x) 
son cardinal, et par H;(x, y) le nombre des entiers n de &(x) qui ont au moins un 
diviseur dans |y, 2y]. Nos résultats sont contenus dans les deux énoncés suivants. 


Théoréme 1.1 Soient R > 0, B > 1/log2. Il existe des constantes positives c, K, 
dépendant au plus de R et f, telles que l’on ait 

cmy(x) < Ay(x,y) < (A — c)ae(x), (1.1) 
uniformément pour 2 < y < x'/K, Blog, x < k < Rlog, x. De plus, l’inégalité de 


droite est valable pour 1 < k < Rlog, x. 


Cet énoncé induit deux questions naturelles, sur lesquelles nous espérons revenir 
dans un prochain travail : 


(a) est-il possible de remplacer, dans les hypothéses, la constante 6 par une quantité 
tendant vers 1/log2 ? 

(b) l’encadrement (1.1) est-il valable pour de plus grandes valeurs de k, par exemple 
Rlog, x < k < (logx)!*? 


Comme annoncé, la valeur 1/ log 2 est critique pour la minoration de (1.1). Dans 
toute la suite, nous utilisons les notations 


log x log u 
w= g@y:=,/—"" — ws y23) (1.2) 
logy (log, y) log, x 


Théoréme 1.2 Soit &(x) — oo. Sous les conditions 


l 
E(x)<y<JVx, kK {1 — aCe. y)}. (1.3) 
og 2 
nous avons uniformément 
Hy(x, y) = o(m(2)). (1.4) 


Remarques. 


(i) Nous avons 


1/V2 + o(1) x 


1 
g Ss 
log, x ‘ /log,y 


dans le domaine (1.3). 
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(ii) Des majorations effectives de H;(x, y) peuvent étre aisément déduites de la 
démonstration donnée au paragraphe 5. Nous nous sommes cantonnés ici a 
dégager le phénoméne qualitatif. 


2 Répartition des diviseurs des entiers de (x) 


Le résultat suivant fait ’ objet du lemme 1 de [12]. 


Lemme 2.1 Soient A > 0, B > 0, et f une fonction arithmétique multiplicative 
positive ou nulle telle que 


>of (p")logp” <Ax (x > 2), roe < <B. 


psx Pp v22 
Alors on a, uniformément pour k = 1, x = 2, 


1 


Yim «<< Ax (Sesf(ohiP + BY” 


GI (2.1) 


n€ E(x) 


ou la constante implicite est absolue. En particulier, pour toute constante R > 0, 
nous avons, uniformément sous la condition 1 < k < Rlog, x, 


Lo <K m,(x) exp {o 2 “a (2.2) 


avec 9 := (k—1)/ log, x et ot la constante implicite dépend au plus de A, B, R. 
o:= ( 3) Pp Pp P 


Dans toute la suite, nous nous donnons des paramétres x > 2, y € [2, /x], et 
Lé [1, log, y]. Soit 5, := expexp L (L > 0). Pour chaque entier n, nous écrivons 


=[*. 


pss, 
p' ||n 


Enfin, il est commode d’introduire la fonction Q : Rt — Rt définie par 
O(a) :=aloga—a+l. 


Notons que O(a) = 0 avec égalité si, et seulement si, a = 1. 


Lemme 2.2 SoientR > 0,x > 3et1<k < Rlog, x. Posant 


L; := min(L, log, x — L) (2.3) 
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et définissant implicitement x €]0, R] par k = k log, x, nous avons, uniformément 
pourl <L<log,x,1<é< JL, 


|{n € G(x) : |o(ny) — KL| > KEVL}| Ke m@e**, (2.4) 


ou l’on peut remplacer & /3 par &7/2 sié < i 


Démonstration. Supposons dans un premier temps que L < 5 log, x, et donc 
L, =L. Posons sj = 1 — €/./L), 52 = 1+ &/./L). Le cardinal 4 majorer n’excéde 
pas 


> ye K m(x) fe KOOVE ie e *O62)L} <K (aye /3, 


j 
1<j<2 n<x 
ou la premiére estimation découle du Lemme 2.1 et la seconde de |’ évaluation 
49° SQO(1+s)<4s°<O0-s) (O<s<)1). 
assertion relative au cas € < L!/° résulte du développement 
O(1 +s) = 45° + O(s°) 


au voisinage de s = 0. 


Lorsque L > ; log, x, et donc L; = log, x — L, nous observons que la condition 
sur @(nz,) apparaissant dans (2.4) équivaut a |w(n) — w(nz) — (k — KL)| > KE VL). 
On peut alors procéder similairement au cas précédent, en considérant les facteurs 
premiers de ]s,, x]. Oo 


Lemme 2.3 SoitR > 0,x = 3 et 1 < k < Rlog,x. On définit implicitement 
k €]0, R] par k = x log, x. Nous avons, uniformément pour | < L < log, x, T > 0, 


| {n € E(x) : logn, > Te’}| K mlxje*?. 


Démonstration. Nous pouvons supposer L assez grand puisque le résultat est trivial 
lorsque L est borné. Posons a := oo. Alors la fonction n +> nf est multiplicative 
et, puisque ./e < 2, satisfait les hypothéses du Lemme 2.1 pour des constantes 


absolues convenables A et B. La quantité 4 estimer n’excéde pas 


> {nz / exp (Te")}" =e 7? > ny. 


n€ Ex (x) ne E(x) 


Le Lemme 2.1 fournit aisément que la derniére somme est < m;(x). Lestimation 
souhaitée en résulte. Oo 
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Nous posons encore 


wy(m) := > 1 (m>1,040), 


plm, p<exp(1/|3|) 


o(n;s,t) := y 1 (0<s<)d), 


p\|n, s<logs pSt 
o(t):=t—/tlogt (t> 1). 
Le résultat suivant est énoncé, a fins d’ applications ultérieures, sous une forme 
significativement plus précise que nécessaire pour le présent travail. 
Lemme 2.4 Soient R > 0, ¢ €]0,5], x = 3, 1 <k < Rlog,x, 1 < L < log, x. On 


définit implicitement k €]0, R] par k = x log, x. Nous avons 


(nt) — 5 (nz) 


min 2.5 
I/(ceX)<|8|<2 o(L + log ||) 3 


pour tous les entiers n de &(x) sauf au plus « m,(x)/{log(1/e)}!/4. La constante 
implicite dépend au plus de R. 


Démonstration. Considérons un entier n contrevenant a (2.5). Il existe alors 
0 €]1/(ee”), 2] tel que, posant jy := |L + log |#||, et donc js > log(1/e) — 1, lon 
puisse écrire, pour tout v €]0, 1], 


l< por ar)—OlaL)—Ko G9) yok (G8) yo jo LE) 


Ainsi, le nombre R, des entiers exceptionnels satisfait 


R, < a con >, ees 


log(1/e)-1<j<L ne &x(x) 


pour tout choix de v; €]0, 1]. Nous pouvons employer Lemme 2.1 pour majorer la 
somme intérieure. Elle est 


< m(xje *O-w, 


Pour le choix v; := 1 — / (log j)/j, nous obtenons que le facteur sommé dépendant 
de j est < m(x)j~>/*. Il suit 


RK > BD ge o 


j>log(1/e)—1 Pe {log(1/e)}!/4 
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Soit Z(m) := Ualm (log d + [ _ 3, of). Nous désignons par A(m) la mesure de 
Lebesgue de (mm) et posons 


t(m,3):= )’d® (me N*, 9 ER). 
d\m 
Le résultat suivant a été établi dans [11]. 
Lemme 2.5 Pour chaque entier naturel m, nous avons 


Mn off TON igs (2.6) 


~ t(m)2 37 


Lemme 2.6 Soit R > 0. Nous avons, uniformément pour x > 3, 1 < k < Rlog, x 
et 0 € R*, 


[c(mz, 8)? ‘ 
som Forty < EM log(3 + De. (2.7) 
ne E(x) 


Démonstration. Appliquons 4 nouveau le Lemme 2.1. Le membre de gauche 
de (2.7) est 


y cos(? log p) 
P 


<K m(x) exp 
log(1/|3|) < log, p<L 


La somme en p peut étre estimée par sommation d’ Abel a |’ aide du théoréme des 


nombres premiers, comme au lemme III.4.13 de [14]. En estimant trivialement la 
contribution éventuelle des nombres premiers p tels que 


exp(1/||) < p < expflog?(3 + |8|)}, 


nous obtenons qu’elle n’excéde pas 2 log, (3 + ||) + O(1). Cela fournit bien (2.7). 
oO 


Lemme 2.7 Soient R > 0, B > 1/log2, 6 := Blog2 — 1. Pour tous x > 2, ¢ > 0, 
k EN, tels que 


Blog, x <k < Rlog, x, 
le nombre M;(x; €) des entiers n de &(x) tels que (nz) < ee! satisfait 
Mi (x36) & r(x) fe8?? + ert). (2.8) 


La constante implicite dépend au plus de R et B. 
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Démonstration. D’aprés (2.6), un entier n compté dans M,;(x; £) satisfait néces- 
salrement 


2 2 
sree! | len, OI ay > 1. 


2 T(nz)? 


Comme la contribution a I’ intégrale des nombres réels 0 tels que || < e“/(12ze) 
est trivialement < 5. il suit 


2 2 
ere! | LEG, OO ay > 1, (2.9) 
e+ /ey 


t (nz)? 


ou, pour simplifier l’écriture, nous avons posé €; := 127. 

Maintenant, nous observons que, quitte a écarter un ensemble d’entiers n de 
cardinal majoré par le membre de gauche de (2.8), le Lemme 2.4 nous permet de 
supposer que 


w(nz) — @s(nz)  1+6/2 
min a 
I/(ee4)<|o|<2  L + log |? | log 2 


et donc 


qotrr)—os (nL) (deh) 4/2, 
Cela implique 


[ ren ay = fl Ic(mz, 9)? __ dd 


Le, T(nz)? -L)e, 20M Fev ML) (Pel) IF52 
En tenant compte de (2.9), nous obtenons 


y It(nz, |)? — d(ve”) 


KQ(a) 
Qolny+ov(m1) (el) I+5/2 + Om (x)e) 


My (x; €) < &1 " 
e+ /ey n<x 


@(n)=k 


< my (x){e°/? + aust 


d’aprés le Lemme 2.6. oO 
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3 Entiers de 4;(x) sans petit facteur premier 


Désignons par P~ (n) (resp. P* (n)) le plus petit (resp. le plus grand) facteur premier 
d’un entier générique n > 1 avec la convention P~(1) = oo (resp. Pt (1) = 1). 
Nous posons alors 


m(x,y) := |{n € G(x) : P-(n) > y}|. 


Dans toute cette section, nous notons systématiquement x = y“ lorsque 2 < y < x. 

Le résultat suivant a été établi dans [9] (lemme 3.2) comme conséquence trés 
affaiblie de formules asymptotiques dues a Balazard [2] (lemme 5) et Alladi [1] 
(théoréme 7). 


Lemme 3.1 Soit R > 0. Il existe up = uo(R) > 1 tel que l’on ait uniformément 


pour u > uo(R), 1 < k < Rlogu, 


x (logu)*! 
logx (k—1)! 


(x,y) X< (3.1) 


Lemme 3.2 Soient cj, Co des constantes telles que 0 < cy, < Co. Il existe 
€1 = €1(c1,C2) > 0 tel que l’on ait, uniformément pour 1 < 7! < y < z < x", 
l<d< J/x etcjlogu<k<clogu, 


m1 (x/d, 2) X mex, y)/d. (3.2) 


Démonstration. Cela résulte immédiatement de (3.1). 


4 Preuve du Théoréme 1.1 


Commencons par établir la minoration. Nous retenons la notation k := k/ log, x 
introduite précédemment. 

Nous pouvons supposer y = yo ol yo est une constante arbitrairement choisie. En 
effet, dans le cas contraire, on a Ay(x,y) 2 Wk—w(m)(x/m,y) >> M(x, y) avec, par 
exemple, m = 1 + [y| €]y, 2y]. 

Soit € une constante absolue positive assez petite, dont nous préciserons la 
valeur ultérieurement. Nous posons L := log, y*, conservons la notation (2.3), et 
considérons |’ ensemble 


1 
Pt(m) < y®, |o(m) — KL| < VL log (=) ; 
€ 


t(m) < 3",A(m) > ee 


M:i= ams Jy: 
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Les Lemmes 2.2, 2.3 et 2.7 garantissent que, pour € convenablement choisi, 
x>xo(e), et K = K(e) assez grand, on any € St pour au moins 51k (x) entiers n de 
& (x). En décomposant chacun de ces entiers sous la forme mh avec P-(h) > y’, il 
suit 


Sx) < > me-oom(x/m, y). (4.1) 
mer 


Considérons alors les entiers n de &(x) qui peuvent s’écrire sous la forme mph 
avec m € IN, logy — logp € L(m), P-(h) > 2y. La seconde condition implique 
l’existence d’un diviseur d de m tel que 


logd— +4 < logy —logp < logd 


donc dp €]y, 2y]. Comme on a nécessairement 
logp > logy —logm = 5 logy > logy’, 


la décomposition n = mph, lorsqu elle existe, est unique. 
Posant Y(y,m) := {t € Rt : log(y/t) € Y(m)}, nous pouvons donc écrire 


A(x, y) = > 2 Tk—1—w(m) (x/pm, 2y). (4.2) 


meEM pEG(y,m) 


Notant 4 présent u := (logx)/ log 2y, nous avons 


k—@(m) =k—-KL+ 0-(VL1) =«klogut+ O-(VL1), 


d’aprés la définition de 9Jt. Lorsque y < x!/*, ot K = K(e) est une constante 
convenable, cela implique k — w(m) =< logu. Nous sommes donc en mesure 
d’appliquer le Lemme 3.2 avec d = p pour déduire que, pour tous les couples 
(m, p) de la sommation de (4.2), nous avons 


Tk—1—w(m) (x/pm, 2y) Ae Tk—w(m) (x/m, y*)/p. 


A ce stade, nous observons que Y(y, m) est, pour chaque m € St, une réunion 
d’au plus t(m) < (log y)* intervalles de longueur logarithmique totale A(m) et dont 
les bornes ont des logarithmes de I’ ordre de log y. Nous avons donc 


1 1 logp A(m) log y 
yo ->— Dd > + Ol 5) > 1. 


PEG (y.m) # logy pEG (ym) P log y 
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En reportant dans (4.2), il suit 


x 1 x 
Ak(X,y) De 2 Toi) (—.9*) 7a as 2 mi-vin)(—.)*) > T(x), 


mem pEG (ym) mem 
ou la derniére minoration résulte de (4.1). Etablissons maintenant la majoration 


de (1.1). A cette fin, nous considérons le nombre J;(x, y) des entiers n de &(x) 
tels que nz < y, avec a présent L := log, 2y. Nous avons évidemment 


T(x) — A(x, y) 2 Jax, y). 
Nous allons montrer que 
Ti (Xx, ¥) > K(x). (4.3) 


Chaque entier de la forme n = mh avec m < y, P~(h) > 2y et w(m) + a(h) =k 
est compté dans J; (x, y). On peut donc écrire 


I(x, y) = > THk—w(m)(x/m, 2y). 


may 
Retenons seulement dans la sommation les entiers m tels que 
|w(m) — KL| < Ly. 


Nous avons alors k — @(m) « logu = log, x — log, y sous l’hypothése y < x!/* 
dés que la constante K est convenablement choisie. Par (3.1) il suit 


x 3 (log uke)! 


Tag (k—w(m) —1)!m 


(4.4) 


may 
|o(m)—KL| SL, 


Une sommation d’ Abel permet de déduire des estimations connues de zx (x) (cf., par 
exemple, [14], chapitre II.6) que l’on a, pour | j — KL] < Ly, 


1 (log, y)/ 
) — Ae — . 
may J 
o(m)=j 


En reportant dans (4.4), il suit 


x (log, x)! k—-1 k-1-j,9j 
Ik(X,Y) > oer 1 2 cg aie 
J-KL|SL 
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avec 0 := (log, y)/ log, x, de sorte que kL = kv et, pour un choix convenable des 
constantes, L; > 2,/(1 — v)k. Un résultat classique sur les sommes partielles du 
binéme (voir par exemple le théoréme 6.1 de [4]) implique que la somme en j est 
> 1. Nous obtenons donc (4.3), ce qui achéve la démonstration du Théoréme 1.1. 


5 Preuve du Théoréme 1.2 


Nous supposons dans toute la suite yo < y < ./x, oll yo désigne une constante assez 
grande. 

Chaque entier n compté dans H;(x, y) admet une représentation sous la forme 
n = md avec d €ly, 2y]. Posons alors 


r = (m,d), m=morr, d=dorer, rir. |r%, 


a=o(m), b=o(d), jr=at+b<k, 


de sorte que w(r) = k —j. 
Il suit 


moro YY Yt 


ik r21,721,m21 y/rry<d<2y/rry moSx/dorir2r? 
atbSj rr [ree o(dy)=b @(mo)=a 
o(r)=k-j 


On montre facilement que la contribution correspondant a la condition supplé- 


mentaire r;r > x!/4 est < m(x)/logx : en effet, une majoration triviale des 
sommes en mp et do fournit une borne 


x ae x 
<« <« —_ap Ss —-= 
a : dX ryrgrs/? > rp(r) 
r2=1,r721,m21 rsx r=1 


ryry|ro° ryr2|roe 


x/4 crr<x 


Lorsque rir < x'/4 des estimations classiques (cf., par exemple, [14], chapitre II.6), 
impliquent que la double somme intérieure est 


x(log, x)! (log, y)?! 
rriro(a— 1)'(b — 1)! (log 3y/r) log x 
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Par sommation sur a et b, nous obtenons 


(log, x + log, yy? 
a __ (logy x + logy yy? 
ME) < A>, 2) FhaG— Dieses 


GSK r21n2ln21 
ryra|roe 


o(r)=k-j 
(log, x + log, yy~? 
KX —_—-—-eronoaoaaoo>Dooo 
2 dX b(r)?(j — 2)! log x(log 3y/r) 
o(r)=k-j 


& I, (X) (1 f 82)! = OG _ logu : 


logy log, x logy 2 log, x 


sous ’hypothése k < log, x. 
Tl vient 


(xu? ee _ Dae (hie 2 


A(x, y) « 1 = 
ogy logx 


ou nous avons conservé la notation k = k/ log, x. 
La majoration souhaitée (1.4) est donc acquise dés que 


7 log, x _ 1—1/log4 


KS 
log 2 log2 


log u — & (x). (5.1) 


Cela fournit la conclusion annoncée lorsque, disons, log y > e log x. 

Nous pouvons donc supposer dans la suite que u > e®. Choisissons L := log, y 
et conservons la notation (2.3). Nous pouvons supposer sans perte de généralité 
que la fonction € = &(x) apparaissant dans l’hypothése (1.3) satisfait également 
€ < JT). Soit kK) = (1 + €VL1/L)k. En vertu du Lemme 2.2, quitte a écarter un 
nombre acceptable d’entiers exceptionnels, nous pouvons supposer que 


(nz) < KL. (5.2) 
De méme, nous déduisons du Lemme 2.3 que l’on peut supposer 
ny <x, := 2xP(-87/2) (5.3) 


Les entiers restants comptés dans H;(x, y) admettent une représentation sous la 
forme n = dtm, avec les conditions 


y<d<2y, Pt()<y, t<m, P(m)>y. 
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Soit Hj (x, y) le nombre de tels entiers n n’excédant pas x et tels que w(n) = k. Nous 
avons 


* ~ ~ 7 ~ 
Hy (x,y) < » ) ) ) 
JNKL y<d<2y Sx mSx/dt 


Pt(t)<y  @(m)=k—-j 
w(dt)y=j  P-(m)>y 


La somme intérieure reléve du Lemme 2.1. Elle est 


x (log,x—log, y+ BAT! 
dtlogx (k-—j—1)! 


ou B est une constante absolue convenable. Posons alors d = rd, t = rt, avec 
(d\,t,) = 1. Nous avons w(d,) + w(t) <j, donc 


~ Ye<dah LGUs 


y<d<2y tS rea a+b<j y/r<diS ye ty Sx] 
P+ (1)<y w(d\)=a o(ti)=b 
o(dt)=j 


25035 _ Clog," oe > _Qlog, y)*_ 
2 —1)!p! ! 

r<2y , a+b<j (a 1)! b} (log 3y/7) roy c<j- c. cllog(3y/r) 
K It x (2 log, y)° K (2 log, yy 

1 ! i! 

O8Y eS c J: logy 


Nous avons utilisé ici les estimations classiques de z¢(v) pour estimer les sommes 
en d, et t;. Pour la derniére majoration, nous avons fait appel au résultat bien connu 
stipulant que, puisque j < k; log, y avec k; < 1/log2 < 2, la derniére somme en c 
est dominée par le terme d’indice c = j. 

Il suit finalement 


3 (2 log, y)/ (log, x — logy y + B)*77! 


Hy (x,y) < ————— 
(logx)logy <= ! (cajp= 1) 


~ 22 1082 Y (log, y)/ (log, x — logy y + BA! 
(log x) log y es J! (k-—j—1)! 
< ae (%) (logy) 81, 


On vérifie alors que (log y)*!!°22—! tend vers 0 dés que 


we (1 - ~~). 


k< 
log 2 Llog4 
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Comme 


logu 


Eilers = g(x, y), 


nous obtenons le résultat souhaité, quitte 4 altérer la fonction arbitraire &. 
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A Ternary Problem in Additive Prime Number 
Theory 


Jorg Briidern 


Abstract Estimates are obtained for the number of natural numbers below a 
parameter that do not have a representation as the sum of two squares of primes and 
a kth power of a prime. These improve earlier bounds in the order of magnitude. 
The method is then also applied to some related questions. 


Keywords Circle method * Sums of prime powers 


2010 Mathematics subject classification: Primary 11P32; Secondary 11P55 


1 Introduction 


This collection of research articles is dedicated to the memory of Wolfgang 
Schwarz. Early in his career he was a practitioner of the Hardy—Littlewood circle 
method [14-16], and he followed later developments of the method itself and its 
range of applicability with great interest. His survey article [17] gives ample proof 
of this, and the collection of open problems contained therein helped this writer 
into the subject [1]. It therefore seems fitting for the occasion to examine here the 
current status of a problem that Schwarz discussed in his thesis. Published in two 
parts [14, 15] and devoted in general to the additive theory of prime numbers, it 
is in the second of his papers where the intent is to obtain proof that almost all 
numbers expected to be representable in a certain proposed form indeed have such 
representations. 

Perhaps the most interesting of his results relates to representations of the natural 
number 7 in the form 


n= py +p3+P5 (1.1) 
J. Briidern (4) 
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in which k > 2 is a given natural number, and the variables p; range over primes. 
One may expect such a representation to exist whenever the congruences 


0 +32+x4 =nmodq (1.2) 


are soluble with (x;,¢g) = 1 (1 <j < 3), forall g € N, and we write % for the set of 
all n € N where this is so. It is well known and easy to prove! that % is the union 
of finitely many residue classes. In particular, # has positive density among the 
natural numbers. Schwarz [15, Satz 3] proved that for any fixed A > 0 the number 
E,(N) of integers n € # with 1 <n < N for which (1.1) has no solution in primes 
P1.P2,p3 satisfies E,(N) < N(logN)™“. Earlier Hua [7] had obtained this same 
conclusion for some positive A. After the fundamental innovations of Montgomery 
and Vaughan [13] in their treatment of the binary Goldbach problem, an estimate of 
the shape 


E,(N) « NI°H (1.3) 


should have been within reach for some positive 5(k), but a proof was only published 
in 1993 by Leung and Liu [9]. Their argument relied on the Deuring—Heilbronn 
phenomenon, as did the work of Montgomery and Vaughan, and hence little could 
be said about the dependence of 6 on k at the time. Five years later Liu and 
Zhan [11] noticed that for representation problems with more than two variables, 
potential Siegel zeros impact the major arc analysis to a lesser extent than is the case 
in a binary situation. This interesting observation has been instrumental in many 
applications since. In particular, dramatic progress was possible with the quadratic 
case of the Waring—Goldbach problem. We only mention the latest bound 


E (N) < N!7/20+e 


due to Harman and Kumchev [5]. For larger values of k, we have (1.3) with 


1 


1 
63)=—-e, d(kK)= 
B= zs 8H 
where ¢ is any positive number, as established by Lii [12] and Li [10, Theorem 1], 


respectively. These two papers are based on very similar ideas. An elementary bound 
for the number of solutions of the equation 


Pi tp, =P3+P4 


in primes p; not exceeding N '/2 is combined with an estimate of Weyl’s type for 
the degree k Weyl sum in a circle method analysis of the problem at hand. When 


‘For details see, for example, the footnote on p. 41 of [3], or inter alia in [15]. 
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k > 3, the best known Weyl! bounds for trigonometric sums over prime powers are 
due to Kumchev [8], and their direct use produces the estimates (1.4). For larger 
values of k, Li’s results can be improved by using Weyl bounds stemming from 
Vinogradov’s mean value theorem. Wooley’s groundbreaking work [19] on the latter 
can be combined with the strategy of Li to show that in (1.3) one may take 6(k) = 
c/k?, for some constant c > 0. 

In this paper we follow a different route and treat the kth power in mean. This 
increases the admissible values for 5(k) by a factor k when k is large, and when 
k = 3 we also obtain a sizeable improvement over the work of Lii [12]. 


Theorem 1.1 One has 
E;(N) « N'5/16+e and E,(N) « N!-1/ Qk? +e (k> 4). 


To access the strength of this result, it is perhaps worth pointing out that the 
circle method approach to problems of this type is currently limited by the square 
root cancellation barrier. For the problem under consideration, this corresponds to 
6(k) = 1/k in (1.3). If one is prepared to give up the primality of p3 in (1.1), then 
one can get reasonably close to this barrier. More precisely, let @ be the set of all 
natural numbers n for which the congruences (1.2) are soluble with (x,x2,q) = 1, 
for all g ¢ N. Let Ef (N) denote the number of all n € @ with 1 < n < N for which 
the equation 


pt+petxan (1.5) 


has no solution in primes p1, p2 and natural numbers x. 
Theorem 1.2 Let k > 4. Then EX (N) < N!-1/07K). 


No effort has been made to optimize the constant 97 that occurs in the exponent 
of the bound for Ef(N). Emphasis is on larger k, and consequently, we have not 
attempted to tune our approach so as to yield good bounds when k is of moderate 
size. 

It is interesting to compare the conclusion of Theorem 1.2 with related estimates 
for the prominent Hardy—Littlewood problem concerning sums of a prime and a 
kth power. When n € Y% we expect Eq. (1.5) to have about n'/*(logn)~? solutions, 
while whenever n — x* is irreducible as a polynomial over the rationals, the equation 
n = p+x* should have about n'/*(logn)~! solutions with p prime. Thus, one might 
be led to believe that the two problems are roughly of the same degree of difficulty. 
However, this does not seem to be the case, given our current understanding of 
exponential sums over primes. Indeed, if D,(N) denotes the number of all n < N 
for which n—x* is irreducible over Z[x] but n = p + x* has no solution with p prime, 
then the bound 


D,(N) « NIWV 256) (1.6) 
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was obtained in collaboration with Perelli [4] almost 20 years ago, yet assuming that 
no Dirichlet Z-function has a zero in the half-plane Re s > 5. Unconditional bounds 
for D,(N) come nowhere close to (1.6). The significant difference between the 
Hardy—Littlewood problem and the representation problem (1.1) is that the major 
arc estimates of Kumchev [8] have an amplifying effect whenever three summands 
are present, but only then. Kumchev’s bounds control Weyl sums related to p} + p3 
to a precision that is available for a single prime only if one is prepared to accept 
the use of unproven hypotheses. 

In his thesis work, Schwarz also considered additive problems with more than 
three prime powers. For example, when k > 2, representations of the form 


Pitp;+p3+py=n (1.7) 
or 
Pit+p3+PS+py=n (1.8) 


are considered. We focus our attention temporarily on Eq.(1.8), and on those 
integers n for which the congruences x7 + x3 + x$ + x4 =n mod q have a solution 
with (%1x%2x3x4,q) = 1, for all g € N. Let A, denote the set of all such n. 
Then Schwarz shows [15, Satz 6] that when n < N andn € ¥,; but n has no 
representation in the form (1.8), with all p; prime, then 7 is in a set with no more than 
O(N (log N)~“) members, again for any fixed A > 0. There is also a corresponding 
result for Eq. (1.7) [15, Satz 5]. Our methods apply to such problems as well, leading 
to analogues of Theorems 1.1 and 1.2 with little difficulty, the overall details being 
somewhat simpler thanks to the presence of a fourth summand. In fact, it is possible 
in the two problems (1.7) and (1.8) to establish an asymptotic formula for the 
number of solutions, with the exceptional set estimate still of strength comparable 
to the conclusion in Theorem 1.1. We content ourselves with a brief discussion of 
such a result for Eq. (1.8). In this context, define the counting function 


e(n)= => (logpi)(logp2) (log ps) log ps (1.9) 
pitpytp$tpy=n 
and the singular series 


co 86g 
sn=>> D> DS o@ ‘ela +3 +38 +24 -2)/9). (1.10) 
q=1 a=1_ I|sxy<q 
(aq)=1 1<j<4 
(,Q=1 
Theorem 1.3 Let k > 2 be a natural number, and let A > |. Then the inequality 


lon) — PPAF ADs(n)n"*| > n(logny 
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holds for no more than O(NI-U/BE) +e) of the natural numbers n not exceeding N. 
Further, there is anumber C > 1 such that forn € ¥, one has 


s(n) > (loglogn) ©. (1.11) 


In particular, all but O(N'~!/8")+¢) of the numbers n € -F with 1 < n < Nhave 
a representation in the form (1.8). It should be noted that -¥; is a union of finitely 
many arithmetic progressions (see [15, Satz 6]), and is therefore a set of positive 
density. 

Throughout this paper, we apply the convention that whenever the letter ¢ occurs 
in a statement, it is asserted that this statement holds for all positive real numbers e. 
Constants implicit in Landau’s and Vinogradov’s well-known symbols may depend 
on ¢€. Eulers totient is g(n), and w(n) is the number of distinct prime factors of n. 


2 A Mean Value Estimate 


We begin our deliberations by discussing a moment estimate for a certain Weyl sum 
that we now introduce. Let k > 3, and put 


ho(a,.X)= Y° e(ax*). (2.1) 
1<x<X 
Let M = X'/*, and let denote the set of primes p with p = —1 mod k and 
M < p < 2M. Then, fort € N, we define the sum h,(a@, X) by means of the recursion 
h,(a,X) = D> hy (ap*,X/p). (2.2) 
pEP 


By (2.1) and (2.2), it follows that one may also write 


hy(a,X) =D) wilx. X)e(ax') (2.3) 
x<X 
where wo (x, X) is the indicator function on {1,2,..., [X]}, and where for ¢ > 1, the 


coefficients w,(x, X) satisfy the relation 


w(x, X) = Ss Wr-1(x/p, X/p). (2.4) 


pix 
pez 
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The sum on the right-hand side of (2.4) has at most k — 1 summands. By induction 
on t, it follows that w,(x, X) = 0 for x > X, and that for x < X one has 


w(x, X) < kK’. (2.5) 


For t > 0 and s > 1, we shall bound the mean value 
1 
1.8.0) =f hole X)Piti (a 0) de (2.6) 
0 


It will be convenient to write 6 = 1 — 1/k. 


Lemma 2.1 Fort > 0 ands > 1, one has 
I, s, t) <K , Clee (2.7) 


This estimate should be compared with Lemma 2.1 of Briidern and Perelli [4] 
where a slightly superior estimate was obtained, yet with a Wey] sum that is not as 
flexible as our h,. Indeed, the Wey] sum used in [4] arises through the same recursion 
process as our h,, but one would start with the extra constraint in (2.1) that the x are 
restricted to primes. With the Riemann Hypothesis for Dirichlet Z-functions in hand, 
this makes little difference, but in unconditional work our new Weyl sum is much 
easier to use. This will become transparent in Sect. 4 below. 

We now commence the proof of the lemma. By (2.3) and (2.5), we have h,(a) « 
X, and consequently, by (2.6), one has the trivial bound /(X,s,t) < X5+2_ When 
t = 0, this already established (2.7), for all s. Further, when 1 < s < k, one has 
6° > 6k > 1/e > 1/3, and (2.7) is again evident. 

We may now suppose that s > k and t > 1, and then proceed by induction on 
s +t. By orthogonality, (2.6), (2.2) and (2.3), one finds that /(X,s,f) equals the 
number of solutions of the diophantine equation 


2s 
xt — xh = Do(-1)p i, (2.8) 
fal 


each solution counted with multiplicity 


2s 


] [-16;.X/p)). (2.9) 


j=l 


and the natural numbers x; and the primes p; subject to 1 < x; < X andp; € Y. 
It may be worth remarking that we have p;y; < X for all 1 < j < 2s whenever the 
multiplicity (2.9) is non-zero. 

Let J) denote the number of solutions of (2.8), counted with weight (2.9), where 
for all 1 < j < 2s one has p; | x; and p; | x2. For / = 1 or 2, let J; denote the 
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number of solutions of (2.8) where p; + x;, counted with weight (2.9). Then, since 
for any solution counted by /(X, s, t) but not by Jo, there is some j such that p; does 
not divide one of x,, x2, we deduce by symmetry that 


1(X,5,t) < Ip + 2s(; +b). (2.10) 


We require an upper bound for Jp. For p = (pi,-.-.,P2s) € F*, let 


G(a, p) = y e(ax*), (2.11) 
1<x<X 
pjlx 1SjS2s) 


Then, by orthogonality and the triangle inequality, and then applying the elementary 
bound [zz ... Zos| & |zi[?* +... + |zos|7°, we infer that 


1 
of 160. p)Pina(aph. X/pr).-. na ap. X/p2)| da 
0 
pe Ys 


1 
<  f 16@.pyPih-vept.x/p)P* aa. (2.12) 
pe Ps o 


Note that at most kK—1 different primes pj € Y can divide a number x with x < X. 
Hence, for the sum in (2.11) to be non-empty, at most k — 1 of the entries of p can 
be distinct, and we may restrict the sum in (2.12) to such p, of which there are no 
more than O(M*~') choices. For a fixed such choice, we deduce from orthogonality 
that the integral in (2.12) does not exceed the number of solutions of the diophantine 
equation 


M4 = PIO Yet... +51 — Vhs) (2.13) 


with the variables x;, x. constrained to 1 < x, < X and p, | x (J = 1,2), each 
solution counted with weight 


2s 
| [-16;.X/pv. (2.14) 


i=l 


We put x; = pz in (2.13) and use orthogonality again to conclude from (2.6) that 


1 
[ 160. p)P liar. X/py)P dee = 10¢/p1.5.1 = 1). Q.15) 
0 
But p; € # so that p; > M, and then the induction hypothesis supplies the bound 


1(X/p1,8,t—1) K (X [Myst 2-438") 
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By (2.12) and (2.15), we then deduce that 
pe Mi! (X/My2t 2+ RO +364) < Xx! 
where 
w= 0+ O(2s+2—k+k(O! + 30°) 
<2s+2-k+k(6'+30°)+6- 70s +2-8). 

Hence, since s > k, we see that 

ape XAT IAAT ROT 36) (2.16) 


as required. 

We now proceed by considering cases. First suppose that J; + Jy < Ip. Then, 
by (2.10), we conclude that (X, s,t) < Jo, and (2.7) follows from (2.16). 

This leaves the case where Jp < [; + In. Write 


H,(a@) = a e(ax*). 
1<x<X 


ptx 
Then, for / = 1 and 2, we infer by orthogonality that 
I 
i= > H,(oe)hy—1 ((-1)'* "pk, X/p)ho(—a)hy ((— 1) ex) | (oe) |? dee, 
pEeP 7 


where now we write h,(a) = h,(a,X) in the interest of brevity. By Hélder’s 
inequality and (2.6), one finds that 


ah Meee 
he > IPRe M12, (2.17) 
pe? 

where 

: k 2, 

Jy = fee) Ura phe X/) 

0 

and 


1 
re i] \H,(ct)|2|hy(cr) 25 de. 
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Note that the right-hand side of (2.17) is independent of /. Furthermore, by 
orthogonality and a consideration of the underlying diophantine equations, one has 
K, < 1(X,s,t). In the case under consideration, (2.10) yields /(X,s,t) «K I + h, 
and (2.17) delivers 


1%, 3,97 < yo TO. (2.18) 
pEP 


Fix a prime p; € #. Then, by orthogonality one finds that J,, is equal to the 
number of solutions of Eq. (2.13) with 1 < x, < X,1 < x. < X,p, + x;xo, each 
solution counted with multiplicity (2.14). By (2.13), we have < = < mod ae and 
since py € &, it follows that pk | x; — x2. However, |x; — x2| < X = Mk < pi 
so that we must have x1} = x2. Hence, by (2.5), (2.6) and orthogonality again, we 
conclude that J,, < XI(X/pi,s—1,t— 1). Since X/p; < X°®, we first deduce from 
the induction hypothesis and (2.7) that 


Jy < X1F6 Ask +k(G'+36") 
P| , 


and then infer from (2.18) the final bound 


ee s, t) < MX! 46 2s-H+KG'+30%) = SESE Se) 


completing the induction. 


3 The Basic Argument 


In this section, we shall present the basic argument that underpins all proofs. Along 
the way, we shall establish Theorem 1.1. Let k => 3, and consider the Weyl sums 


f(a)= Yo elap*)logp, g@)= > e(ap‘)logp. (3.1) 


4N<p?<N 4N<pk<N 


Whenever # C [0, 1] is measurable, we put 
ran) = | fla)*ela)e(—an) da. 6.2) 
B 
By orthogonality, 


ro@.N)= >> (logp:)(logp2) log ps. 
pitpst+ph=n 
$N<pi.p3.pS<N 
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For N <n < 2N,n € %, one expects rjo,1)(n, NV) to be of size N'/« and we proceed 
to establish this for almost all n. Let 


O=Nv), (3.3) 
and write Jt for the union of the intervals 
fa € [0, 1] : |ga — al < ON! (logN) 7%} 


with 0 <a <q, (a,q) = 1 and1 <q < Q. We also put m = [0, 1] \ 9. 
The evaluation of roy (n, N) we are fortunate to be able to borrow from the work 
of Li [10]. Let 


q q 


Sn0= s+" 5 (gq) °e(aq +25 +44 —n)/q), 
q<Q a=1 X14X2,.%3=1 
(a.q)=1 (x1x2x3,q)=1 


JON = DY mm)? 9". (3.4) 
my +m2+m3=n 
4N<m, m2,3<N 


Then Proposition 2.1 of Li [10] asserts that whenever N < n < 2N, one has 
1 
ron(n,N) = FG (n, Q)I(n,N) + OW! (log N)~™), 


and for these n, the lower bound J(n, N) >> N '/k is immediate. Further, Lemma 3.1 
of Li [10] yields G(n, Q) >> (log N)~!** for all but O(N!~!/85+*) of the integers 
n€ AG with N <n < 2N. This establishes the following result: 


Lemma 3.1 For all but O(N'!~'/©) of the integers n € H4, with N <n < 2N, one 
has roy (n, N) > N"/‘(log N)7!*. 


Our treatment of the minor arcs depends on an important estimate of Kumchev 
[8] that we now describe in a language suitable for application within this paper. Let 
1<Y<N"/ 8 and let NY) denote the union of the pairwise disjoint intervals 


Ny al(Y) = {a € [0,1]: |gqa—a| < Y/N} 


with 0 < a < q, (a4,q) = land 1 < q < Y. We write N% = N(N1/8) and n = 
[0, 1] \ Dt. We define the function Y : [0, 1] — [0, 1] by putting Y(a@) = 0 fora En, 


? An oversight in [10] is corrected here. The variables pr. p>. pi run over (4N, N] in [10], but then 


3(n,N) = Oforn < 2N which is not acceptable. If these variables run over ( iN , N] instead, as 


we have arranged matters here, then the proof of Proposition 2.1 in Li [10] becomes valid. 
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and when a € It Ny.a(N'/8) by writing 
Y(a) = (G+ Ngo —al)™. 
Lemma 3.2 Uniformly for a € [0, 1], one has 
f(a) |? « NU/8+e 4 Nt ETC). 


Proof Theorem 3 of Kumchev [8] provides an estimate slightly stronger than that 
claimed in Lemma 3.2, but for the sum 


Thus, Lemma 3.2 follows by partial summation. 


Before embarking on the estimation of the minor arc integral, we collect a 
number of mean value estimates. The first of these is the inequality 


1 
/ If(a)|* da <« NIT (3.5) 
0 


that follows from a consideration of the underlying diophantine equation and Hua’s 
lemma [18, Lemma 2.5]. Further, when k = 3, we put u = 4, and when k > 4, we 
put u = k?. Then, the upper bound 


1 
/ |g(a)|> da < NH/H-l+8 (3.6) 
0 


is again a consequence of Hua’s lemma in the special case k = 3, and for larger k this 
bound follows from Wooley’s estimates for Vinogradov’s mean value (for example, 
(20, Corollary 1.2], where an even stronger bound is obtained) and a consideration 
of the underlying diophantine equation. 

We initiate the minor arc analysis by applying Bessel’s inequality to (3.2). Thus 


yo rm(n,NY < i IF(a)[*|@(@) 2 da. 3.7) 


N<n<2N 


By Holder’s inequality, 


[ V(a)|"le(apl? de < sup canit( f “(flay dar) "( / “|e(a) 2" dar) 
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Hence, by (3.5), (3.6) and Lemma 3.2, 
/ f(a) |*|2(@)|? da < N/m nll /u py 2/k)—1/u <K eats (3.8) 
n 


where 6 = 1/(4u). 

This leaves the set St m for treatment, and this set is covered by the union of sets 
AR(Y) = N(2Y) \ NY) as Y runs over 2-1/8, with O(logN) < Y < NV, 
Note that Y(@) < Y~! fora ¢ N(Y). Further, Lemma 2 of Briidern [2] supplies the 
bound 


/ Y (a) | g(a) |? da <« YNVETET! 4 yelk tend, 
M(2Y) 


This implies that 


/ Y (a)? |g(a)|? da <K Ni/kte liy lyy2/k+e 1 
RY) 


For a € 9, we deduce from Lemma 3.2 that |f(a)|>_ «< N'**Y(q). Hence, on 
summing over Y, we infer that 


/ If(a)|*| g(a) |? da « NIT AO+ | (3.9) 
mynm 


Note that this bound is superior to the one in (3.8), so that we now deduce from (3.7) 
that 


> rm(n, NY? <K N!+2/k-S+ 8. 
N<n<2N 


Consequently, the inequality |rm(n,N)| > N'/‘(log N)~>* can hold for no more 
than O(N!~*+*) of the integers n € (N,2N]. The conclusion of Theorem 1.1 now 
follows by combining this last observation with Lemma 3.1 and a dyadic splitting 
up argument. 


4 A Variant of the Main Theme 


We now tune the basic argument to deliver Theorem 1.2. Fix k > 4, and recall that 
6 = 1—1/k. Then choose t € N such that 97"! > + > 6°. With this choice of 1, 
we put h(a) = h,(a, N'/*). With f(a) as in (3.1), we define 


H(n,N) = [ fee?Hae(-am és, 
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where # C [0, 1] denotes a measurable set. By orthogonality, (3.1) and (2.3), one 
has 


Toa) (2, N) = > w(x; N'/") (log p1) log po. (4.1) 
pitpitc=n 
4N<pi p3<N 


Much as in the previous section, we expect that for most n € (NV, 2N] NG the count 
10.1] (n, N) comes close to n!/*, and this can be established by invoking Lemma 2.1 in 
place of (3.6) within the minor arc work performed in the previous section. However, 
the use of the sums /(a) causes extra complication in the major arc analysis. 

We overwrite previous usage by now putting 


Q=NeK 
Let 9 and m be defined as in Sect. 3, but with this new value of Q. The arcs Jt, It(X) 


and n retain their meaning from Sect.3. We now mimic the argument departing 
from (3.7). Put s = 4k and then apply Hélder’s inequality to see that 


[ veeottetay? ae = sup reoi*( J ireyt*ae)' "(fren P* a)" 


By Lemma 2.1, (3.5) and Lemma 3.2, we discern that 


[vtco*nay*| aa K NUAIFENIAUs (yCS/D-1FOF30V 1/5. ee YI Q/D tate, 
n 


(4.2) 
where 


is ~(6' + 36-2). 


Recall the definition of t, and observe that 0“ < e~! and e* > 54. It is now readily 
confirmed that 


a< (36° - 5) < — (4.3) 


Next, we consider the range 9t N m. Mimicry of the deduction of (3.9) yields 


/ If(a)|*|h(a)|? da < yore. 
mam 
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Now, since k > 4and6'> 46> 


= 36 x , we infer from (4.2) and (4.3) that 


I F(a) |*|A(@)|2 dar <« NIE ar 


By using Bessel’s inequality as in (3.7), this estimate allows us to conclude as 
follows: 


Lemma 4.1 For all but O(N'~'/@%) of the integers n € (N,2N] one has 
Ir* (n, N)| < N'/E (log N)~ 1%, 


For the major arcs, we have to prepare for an application of the methods of Liu 
and Zhan [11] with quite some effort. We begin with an explicit formula for the 
sums h(a). Put P = N!/«, With t still as above, let Y (N) denote the set of all tuples 
p = (/pi,...,p;) built from primes pj = —1 modk (1 < j < 2) that satisfy the 
inequalities 


PME <p, < 2PYE, (P/(p1...p))"* < pyr 5 2(P/(pr..-p))"* A Si<9. 
(4.4) 
For p € Y(N) we write u = u(p) = pip2...p;. Then, repeated use of (2.2) yields 


h@)= > >> e(a(ux)'). (4.5) 
peY (N) x<P/u 
We summarize some estimates concerning Y (NV) in the next lemma. 


Lemma 4.2 For all p € Y(N) one has u(p) = P!~. Further, 
#Y (N) x P!~* (log N)~*. 


Proof An inspection of (4.4) shows that there exist constants 0 < c’{ 7 =< ti < 
c2j < ¢), with the property that whenever c je < pj <e spe holds for all 
1 <j <1, then p € Y(N), and whenever p € Y (N), then c, Pie <pj < Ce le 
for all j. The conclusions of Lemma 4.2 now follow from Chebyehey! s estimates. 


We are ready to develop a major arc approximation for the sum h(@). When 
aéZ,qéeNand 6 € R, we put 


q 


Y 
S(q.a) = Yeax'/q). ViB.¥) = f elBy"yay. 
x=1 0 
Then, according to Theorem 4.1 of Vaughan [18], one has 


> (2 + B) (ux)*) = q'S(q,au")V(Bu‘, P/u) + O(g°(q + NalB\)""”). 


x<P/u 
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A change of variable shows that V(Bu«, P/uw) = u~'V(, P). Hence, on taking a = 
(a/q) + B in (4.5), we deduce from Lemma 4.2 that 


S(qg, au* - 
na) =v(6.P) So 2 4 oP!" ¢(q + Nalpl)")) 
pe (N) 


where again we wrote u = u(p) in the interest of brevity. Next, consider the sum 


v(B) = 7 mel), 


m<N 


Maclaurin’s summation formula yields V(6, P) = v(B) + OU + P|B|). Hence, if 
we define the function h*(q@) fora = (a/q) + B € WM with g < Q, (a,q) = 1 and 
|B| < Q/N by 


k 
h(a) = v(B) >> aa, (4.6) 


pew (N) 
we find that uniformly for w € Jt one has 
h(a) = h*(a) + O(P!-2°'+*), 


Let 
Han) = [ h* (a)f (a)? e(—an) da. 
om 


Then, via Bessel’s inequality, we deduce from the previous estimate that 


Sina) =H O.m)? = f [h(a — He yflay*Pda < PHN, 
N«<n<2N am 


and we may conclude as follows: 


Lemma 4.3 For all but O(N'~'/) of the integers n € (N,2N] one has 
[74 (n, N) — r'(n, N)| < N'/* (log N)~ 10%, 


We now evaluate 7’(n,N) further, replacing the exponential sum f(a) by its 
natural approximation. This is straightforward by the method proposed in [11]. In 
this process we receive help from the function h*(a), as this decays in q. To see 
this, note that whenever (a,q) = 1, one finds from Theorem 4.2 of Vaughan [18] 
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that q-!S(q,au") « q-'/*(q,u')'/*. Consequently, by (4.6), Lemma 4.2 and 
[18, Lemma 2.8], 


JU _ _ 
h* (a) « P(q + P*|qa —al)~'/* > a) & P(logN)“‘(q + P*|qa — al)". 


pe (N) 


In such a situation, the method of Liu and Zhan [11] is particularly easy to apply, 
and on following the recent exposition of Hoffman and Yu [6], for example, one 
identifies a leading term, featuring the singular integral (3.4) and a kind of singular 
series that we now introduce. Let 


q 


S*(q.a)= Yo e(ax’/q). 


x=1 
(x.q)=1 
and then put 
q 
Au(g.n) = 9(9) °¢'| Yo S*(q,a)°S(q, auye(—an/q) (4.7) 
po 
to form the sum 
OM,N)= D> up)! D> Au (a.n)- (4.8) 
pew (N) q<Q 


Equipped with this notation, we summarize the outcome of the Liu—Zhan method in 
the next lemma, but as we pointed out already, there is no need to present a detailed 
proof because the reader will have no difficulty in providing one along the lines of 
Hoffmann and Yu [6, Sect. 6]. 


Lemma 4.4 Let A > 1. Then, for all N <n < 2N, one has 
r'(n,N) = O(n, N)3(n, N) + O(N'/* (log N)). 


Our next task is to disentangle the sum over p in (4.8), and realize the partial 
singular series 


S*(n,g) = ))Ai(g.n) (4.9) 


qsQ 


as a factor. With this end in view, we examine A,,(q, n) more closely. The number of 
incongruent solutions x1, x2,.x3 of the congruence x} + x3 + (ux3)* = n mod q with 
(x1X2,q) = 1 is a multiplicative function of g, and by orthogonality, this function is 
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given by 


q 
q | >) S*(q,a)S(q, au‘)e(—an/q). 


a=1 


Hence, by Mobius’s inversion formula, we read off from (4.7) that A,,(q, 1) is also 
multiplicative as a function of g. Thus, we may restrict attention to the case where 
q = p' is a power of a prime. In this instance, we apply a result of Hua [7], showing 
that S*(p',a) = 0 holds whenever p + a and / > 2 when p is odd, and also for 
p = 2,1 > 4. Hence, by (4.7), for all uw € N one has 


A,(p',.n)=0 (> 2,p> 3, orl >4,p =2). (4.10) 
Now suppose that p is an odd prime. When p | u, we find from (4.7) that 


p-l 


Au(p.n) = Ap(p.n) = (p— 1) Y | S*(p,a)"e(—an/p). 


a=1 
We write 
P ae 
i= = e(x’/p), ¢(n) = Y  e(—an/p). 


x=1 a=1 


Then, by familiar properties of the quadratic Gaus sum, whenever p + a we have 


P 
* oz 2, — _ e = 
S"(p, a) = ae /p)-1= (5) 1. 
Hence 
* 2 2D = a 
S*(p.ay = +1 (=). (4.11) 
and 


p-l 


Ap(p.n) = (p— 1) 7 + ep(n) — 2(p — I) tp D | (Fe 7 >): 


a=1 


The remaining sum on the right transforms into (n | p),T,, so that 


Ap(p.n) = (p—1)7(t) + Mep(n) — 2(p- lp P(5). 
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From the classical identity |t,|? = p we now conclude that 
|Ap(p.n)| < 8p~'(p.n). (4.12) 


Next consider odd primes p with p + u. Then, a simple transformation shows that 
S(p, au) = S(p,a), and hence that A,,(p,) = A(p,n). When p + a one also has 
(Vaughan [18, Lemma 4.3]) 


S(p.a) = >> x(t) 
Xx 


where y runs over the (k, p — 1) — | non-principal characters y, mod p, for which 
x‘ is principal, and where 


Pp 
u(x) = >- x(ae(a/p). 


a=1 
By (4.11), 


p-l 


A(p.n) = (p= DP" YY Hat + 1 - 2(7 eel =) 


a=1 x P 


p-l 
= (pI P' cG + VY x@e(- ) 
x a=1 


p-l 


Ce ge BlL eS na (=e( - =). 
Xx a=1 


When p | n, the first summand on the right vanishes, and so does the second 
unless y = yx is the Legendre symbol. From the standard upper bound for Gaui 
sums, it now follows that |A;(p,7)| < 2/(p — 1) whenever p | n. If p + n, then the 
sum over a in the first summand on the right-hand side of the preceding display again 
transforms into a GauB sum, and so does the sum over a in the second summand 
unless again x is the Legendre symbol, in which case the relevant sum becomes the 
Ramanujan sum c,(n). A short calculation now shows that whenever p + n one has 


4k 6k 
Ate)|=——Ss —, (4.13) 
p-l” p 
and we then see that this bound holds for all odd primes p. 
Based on (4.12) and (4.13), we may now evaluate @(n, N) in terms of the singular 
series. 
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Lemma 4.5 We have 


O(n, N) = G*(n, Q) > u(p)~! + O(N~8'/*), 
peZ (N) 


Proof By (4.8) and (4.9), 


1 1 
OM,N)-6*0,0) D> -= DD => 5 (Augn)-Aign)). (4.14) 
pe% (N) 7 pe& (N) e qsQ 


By (4.10), we may restrict the sum over gq to the set 
2={q<0:1644q,p + ¢ forall odd p}. 


Further, for each pair p,g with p € Y(N), q € @ and (u(p),g) = 1, one has 
A,(q,n) = A,(q,n), so that these pairs do not contribute to (4.14). Hence the sum 
in (4.14) may be restricted further, to those pairs p, g where (pip2...p;,q) > 1. 


Let J denote a non-empty subset of {1,2,..., y}, and let 
HI) = {((p.qg) € VN) x 2: p; | qforie I, p; t qforj ZT}. 


The argument from the preceding paragraph shows that any pair p, g that makes a 
non-zero contribution to the sum on the right-hand side of (4.14) is in some -# (J). 
Now suppose that (p,q) € -# (/). Then, at least when N is large, the p1,...,p; are 
all distinct, and hence, the number 


u= | [pi 


ie! 
is a divisor of g with (q¢/u', u(p)) = 1. It follows that 
A,(q.n) = Ayu’, njA,(q/u', n) = Ay (u’, nA, (q/u', n), 


and we infer that 


Au(q.n) — Ay(q,n) = (Aw (u',n) — Ai(u',n))A\(g/u',n). 


Consequently, the contribution from (p,q) € -# (/) to the sum in (4.14) is no larger 
than 


1 
DE A/a!) (Aw(u'.n) — Aw’): 
pew (ny & qe 2 

q=0 mod u’ 
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We now write v = g/u' and deduce from (4.12) and (4.13) that the expression in 
the preceding display is bounded by 


1 (6K)2\ 82H) (u’,n) 
« © (Se) « og y* SY —. 
pew (N) v<O pe (N) 


Here the sum over p factorizes. For i € J the corresponding factor does not exceed 


Pe & yO, (4.15) 
pixNa tk i 
while for j ¢ J the sum over p; is certainly bounded. Since J is non-empty, we have 
at least one factor (4.15), producing the estimate O(N aay *) for the portion of (4.14) 
where (p, gq) € -# (/). Summation over J completes the proof of Lemma 4.5. 

We now need a lower bound for the singular series. Although there is no explicit 
reference at hand for the sum G*(n,Q), there is no difficulty in adjusting the 
arguments of [9, Sect. 6] to the present needs, and we obtain the following result, 
analogous to Lemma 3.1 of Li [10]. 


Lemma 4.6 For all but O(NO*~'/) of the integers n € (N,2N] NG, one has 
6*(n, Q) > (logn)~**, 


We are ready to establish Theorem 1.2. Indeed, from Lemmas 4.2, 4.5 and 4.6, 
we find that O(n, N) >> (log N)~!°* holds for all but O(N~!/0™) of the integers 
n é& (N,2N| NG. By Lemma 4.4 and (3.4), we deduce that for these n we also have 
r'(n,N) > N'/ (log N)~**, The definition of t implies that t < 5k, and hence, by 
Lemmas 4.1 and 4.3, we finally see that the lower bound 


rio). N) > N'/“(log N)~* 


holds for all but O(N~!/0”) of the integers n € (N,2N| NG. By (4.1) and a dyadic 
dissection argument, this confirms the conclusions recorded in Theorem 1.2. 


5 The Quaternary Problem 


We end with a short sketch of a proof of Theorem 1.3. Since we attempt to establish 
an asymptotic formula, we can no longer work with localized Weyl sums but have 
to use their brethren 


Fi(a) = Y 7) e(ap') logp. 


pl<N 
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Then, by (1.9) and orthogonality, whenever n < N one has 


1 
i= i Fy(0) F(a) Fe(0)Fe(a)e(—on) da. 


We define major and minor arcs {Jt and m as in Sect. 3, with Q defined by (3.3). 
When @ is a measurable subset of [0, 1], we put 


ow(n,N) = i F(a) Fs(a)Fs(a)F;(a)e(—an) de, 


The major arc work will not detain us for long, as this is standard for the 
experienced worker in the area. First, one begins by applying the now standard 
methods of Liu and Zhan [11], and this leads to the preliminary asymptotic relation 


om(n,N) = PS) GP (Z)s(n, O)n'* + O(N'/* (log N)™) 


that is valid for any fixed A > 1| and all n < N, and in which the partial singular 
series is given via 


s(n, Q) = )° B(q,n) 


qsQ 


and 


q 
Ban= >) >> o@ ‘eG t+4+%8 +x4-n)/9). 


a=1_  |S<xj<q 


(j.g=l 


The theory of the arithmetic function B(g,n) is similar to that of A(q,n) in 
the previous section. In particular, B(g,n) is again multiplicative in g, and by 
an argument paralleling that leading to (4.10), (4.12) and (4.13), one finds that 
B(p,n) « p~7/?(p,n)'/? while B(p',n) = 0 holds for all p, all / > 2. It follows 
easily that the sum (1.10) converges absolutely, and that s(n) —s(n, Q) « n°Q?-"/?, 
In particular, we see that 


om(n,N) = (5) (4) (Z)s@)n'* + OW'/F (log N)). (5.1) 


To obtain a lower bound for s(n), we use multiplicativity to write the series as an 
Euler product which takes the shape 


s(n) =] + B(p.n)). 


Pp 
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By orthogonality, one deduces that 1+ B(p,n) = p(p—1)~*M(p, n) where M(p,n) 
is the number of incongruent solutions of the congruence y;+y3+ yS+ yk =nmodp 
with p + y:yay3y4. The congruence condition in Theorem 1.3 assures that M(p,n) > 
1, and so the Euler factors of s(n) are positive, and bounded below uniformly in n 
by p(p — 1)~*. Further, for p + n, we noted earlier that B(p,n) < p~*/?. Hence, we 
conclude that 


s(n) > | Jd + B(p.n)). 


pin 


For p | n, we mentioned already that there is a number c > 0 with |B(p, n)| < cp7!. 


We apply this bound for p | n with p > 2c, and use the uniform lower bound for 
1 + B(p,n) for the smaller primes to confirm (1.11). 

This leaves the minor arcs m. The main difficulty here is that Kumchev’s 
exponential sum estimates refer to localized sums. We have to turn these into bounds 
for F(a), and this requires some care. 


Lemma 5.1 Uniformly for a € [0, 1], one has 
F(a)’ < N7/8+¢ + N'**Y(q@), F3(a)? < N13/21 +e + N7/+eY (a), 


Proof Let P = N'/? anda € [0, 1]. For 1 < R < P, put 


f(a,R)= > e(ap’)logp. 


R<p<2R 


A dyadic dissection argument provides some R = R(a) with P’/8 < R < 3P and 
the property that 


Fra) = SY) e(ap*)logp + O(P™*) « P’® + F(a, R)| logN. (5.2) 
P18 <p<P 


Fix a (small) 6 > 0. If [f(@,R)| < P’/8+8, then Fx(a) « P7/8+®. In the opposite 
case, we apply Dirichlet’s theorem on diophantine approximation to find a € Z, 


q € N with 1 < gq < R*” and |qa — a| < R~*/? before using Theorem 3 of 
Kumchev [8] in conjunction with partial summation. This yields 


PUBS < |F(a, R)| K RY/8** + RIF8(g + R?\qa — al). (5.3) 


It follows that g + R?|qa — al «< R?+*P~7/+-*. For large N, this implies that 
q < P'/4 and |gqa — a| < P~7/*, and hence that a € Nt. The trivial bound 


R°(q + P’|qa —al) < P?(q + R’|qa — al) 
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now suffices to conclude from (5.3) that |f(a, R)|? << N'*®Y(q), and (5.2) then 
delivers |F2(a)|? < N'*®Y (a). We have now shown that whenever 6 > 0, one has 


|Fo(a)|? « N7/8+5 4 NIFeY(q@), 


This establishes Lemma 5.1 for F2, and the bound for F3 follows by the same 
argument. 

We are ready to run the basic argument from Sect.3 for the minor arcs. One 
observes that 


1 
i |F2(a)F3(a)F6(a)|* da « N'T* (5.4) 
0 
(see Schwarz [15], Lemma 1.2 and Korollar), and as in (3.6) we also have 
: 2 
i |F(a)|* da <« Nhe (5.5) 
0 
By Holder’s inequality, we find that the integral 
[ \escorero(ayFitay? da 
n 
does not exceed 
1 1-1/2, pl 1/k? 
sup |Fa(a)Fa(a)Fo(a)™( f |FaFsFoP'da) (|r de) 
aen 0 0 


We now use (5.4), (5.5) and Lemma 5.1 for F, together with trivial bounds for F3F'¢ 
to conclude that 


/ |F2(a)F3(a)Fo(a) F(a) [2 da « N'tE Hae te, (5.6) 


For the set 9t N m the argument from Sect. 3 requires an upgrade. First we apply 
Lemma 2 of Briidern [2] to confirm that 


[ reir? aa <n, 
MN 
and we then see via Lemma 5.1 that 


i |F2(a)F6(a)|? da « NI/3+8, 
NM 
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For a € St \ Mt(N'/?!), we find from Lemma 5.1 that |F3(a)|?7 « N!3/2!+*, The 
trivial bound for F; is now enough to confirm the estimate 


20 2 
i aoe |Fo(a)F3() Fo (a) F(a) |? da << N2tEFE (5.7) 
mM\sN!/2 


For a € 3t(N!/?!) Lemma 5.1 produces |F2(a)F3(a)| « N°/3+*Y (a), and then 
the argument leading to (3.9) delivers the bound 


/ lFr(o) F(a) Fo(a) Feo) |? doe << Nite, 
s(N1/21)0m 
This estimate combines with (5.6), (5.7) and Bessel’s inequality to 


2 2 1+¢-4+6¢ 
Diem(n.N)Y < J |Fo(a)F3(a)Fo(a)Fx(a)|? da << NITE, 
m 


n<N 


It follows that for all but O(V!~!/(8k")+2¢) of the integers n < N one has 0m(n,N) < 
N'/‘-© This combines with (5.1) to confirm the conclusions in Theorem 1.3. 
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An Improvement of Liouville’s Inequality 


Yann Bugeaud 


To the memory of Wolfgang Schwarz 


Abstract The Liouville inequality gives a lower bound for the distance between 
two distinct algebraic numbers in terms of their heights and their degrees. We refine 
the classical estimate in the special case where one of the algebraic numbers is very 
close to one of its Galois conjugates. 


Keywords Algebraic numbers * Height * Liouville’s inequality 


2010 Mathematics subject classification: Primary 11J04; Secondary 11J68 


1 Introduction and Results 


Throughout this note, the height H(a) of an algebraic number a (in the complex 
field C or in a p-adic field Q,) is the maximum of the absolute values of 
the coefficients of its minimal defining polynomial over Z (that is, the integer 
polynomial of lowest positive degree, with coprime coefficients and positive leading 
coefficient, which vanishes at a). The Liouville inequality gives a lower bound 
for the distance between two distinct algebraic numbers in terms of their heights 
and their degrees. We state below a version of it, which is a consequence of 
Theorems A.1 and A.3 of [1]. 


Theorem 1.1 Let a and B be distinct algebraic numbers of degree n and m, 
respectively. Then, 


la = Bl > gi-a (n Ae i a (m + ij H(a)-” H(p)~". 
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Recently, it was observed in [2] that, if @ and 6 are quadratic numbers, then 
Theorem 1.1 can be improved when a is very close to its Galois conjugate. A precise 
statement is Lemma 7.1 of [2], which we reproduce below. 


Lemma 1.2 Let o and B be real quadratic numbers. Denote by a’ and B' the Galois 
conjugates of « and B, respectively. Assume that a, a’, B, B' are distinct. Then, we 
have 


la — B| > 0.03- max{|a — a’|~!, 1}- H(a)? - H(B)°. 


The quantity max{|w — a’|~!,1} can be as large as H(a)//5 (take for a a 
root of the polynomial (m? + m — 1)X? — (2m + 1)X + 1, where m > 2 is an 
integer), making then a substantial improvement on Theorem 1.1. Lemma 1.2 is a 
key ingredient in the proof that certain transcendental continued fractions cannot be 
very well approximated by quadratic numbers. 

The purpose of the present note is to extend Lemma 1.2 to algebraic numbers of 
arbitrary degree. 


Theorem 1.3 Let a and B be distinct algebraic numbers of degree n > 2 and m, 
respectively. Then, for any Galois conjugate a’ of a, distinct from a and satisfying 
la —a’| < 1, we have 


lof] = 21" 1M? (m1)? ming Hee)”, Jar—ae'|""} Ha)” HB). 


Theorem 1.3 holds if «’ = £, but it yields a weaker estimate than Theorem A.3 of 
[1], which states that the distance between two distinct conjugate algebraic numbers 
a and a’ of degree n > 2 satisfies 


la—a'| > V3 (n+ 17"? Aa)", (1.1) 


An inequality similar to (1.1) was proved by Mahler [6] in 1964. As we have seen 
after Lemma 1.2, it is sharp for n = 2. Sch6nhage [9] (see also Evertse [5]) proved 
that (1.1) is also sharp for n = 3. However, for n > 4, we do not know whether the 
exponent —n + | of H(q) in (1.1) can be replaced by a lateet number. Results of [3] 
show that it cannot be replaced by a quantity greater than —> — Tey 2 

Theorem 1.3 is an immediate consequence of Theorem 3.1, eee is stated and 
proved in Sect.3. We display below another direct consequence of Theorem 3.1, 


which includes Lemma 1.2 (up to the numerical constant). 


Theorem 1.4 Let a and B be distinct algebraic numbers of degree n > 2 and m, 
respectively. If m > 2n — 2, then, for any Galois conjugate a’ of a, distinct from a, 
we have 


la— Bl) > 2" (n+ 1)-"*!? (m+ 1)7””? min{1, |a — a’ |7!} H(a)~”" H(B). 
(1.2) 
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If H(B) > H(a)?"—)/", then, for any Galois conjugate a’ of ot, distinct from a, we 
have 


la — B| > 22 "(n+ 1°" (m+ 1)7”? min{1, ja — a |7'} H(w)” AB). 
(1.3) 


In the sequel, p always denotes a prime number and | - |, is the p-adic absolute 
value defined on the field Q,. 

The p-adic analogue of (1.1) was proved in [7], without an explicit numerical 
constant. An explicit estimate was obtained by Pejkovic [8], who established that 
the p-adic distance between two distinct conjugate algebraic numbers @ and a’ of 
degree n > 2 satisfies 


la —a'|p =n"? Aa)", (1.4) 


See [8] for a discussion on how sharp (1.4) can be. Evertse (private communication) 
recently proved that (1.4) is sharp for n = 3. 

The p-adic analogue of Lemma 1.2 was established in [4], where it was used to 
show that certain p-adic numbers cannot be very well approximated by quadratic 
p-adic numbers. 

We prove the p-adic analogue of Theorem 1.3. 


Theorem 1.5 Let p be a prime number. Let a and B be distinct algebraic numbers 
in Q,, of degree n > 2 and m, respectively. For any Galois conjugate a’ of a, distinct 
from a and satisfying |a — a’|, < 1, we have 


la — Blp = (m+ 1) "(n+ 1)? min{H(a)””, | — a" |5"} Ha) "H(B)”. 


Theorem 1.5 is an immediate consequence of Theorem 3.2, stated and proved in 
Sect. 3. 

The proofs of Theorems 1.3 and 1.5 are not difficult. They essentially follow the 
classical proof of Liouville’s inequality. It is our hope that Theorems 1.3 and 1.5 
have further applications beside those in [2, 4]. 


2 Auxiliary Results 


As mentioned in the introduction, the height of a polynomial P(X) with complex 
coefficients 


P(X) = a,X" +... + a,X + a9 = a,(X — a)... (X — an), 
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denoted by H(P), is the maximum of the moduli of its coefficients. If P(X) is non- 
zero, its Mahler measure is, by definition, the quantity 


M(P) == lanl | | maxt1., lai}. 


i=1 


The Mahler measure of a complex algebraic number a, denoted by M(q), is the 
Mahler measure of its minimal defining polynomial over Z. 


Lemma 2.1 Let a be a non-zero complex algebraic number of degree n. We have 
the inequalities 


2” H(a) < M(a) < Vn+ 1H (a). 


Proof This is an immediate consequence of Lemma A.2 of [1]. Oo 
For the proof of Theorem 1.5, we need an auxiliary result of Morrison [7]. 


Lemma 2.2 Let P(X) = a,X" +--+ aX + ao = a,(X — @))---(X — a) bea 


polynomial with coefficients in Q,. Then, for any non-empty set I © {1,...,n}, we 
have 
lanlp- I] lailp = _max  lailp (2.1) 


and, if P(X) is in Z[X] and & is in Q,, then 


lanlp = | [ 1€ — ailp < max{1, |&/7}. (2.2) 


ie] 


Proof See {7, p. 341]. Oo 


3 Proofs 


Theorems 1.3 and 1.4 are direct consequences of Theorem 3.1 below. 


Theorem 3.1 Let P(X) and Q(X) be non-constant integer polynomials of degree n 
and m, respectively. Denote by a a zero of P(X) of order s and by B a zero of Q(X) 
of order t. Assuming that P(B) 4 0, we have 


IP(B)| = (a+ 1 ami (m+ hye H(p)'-™/* H(Q)"" 
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and, if a2 # a is a Galois conjugate of a with |a — a| < 1, 


la = B| > gi-nls(n +4 {yaa (m+ jy ve 
xH(P)—"/ 9 H(Q)7"/“) (3.1) 
x min{|a _ a|—!, gn/(2s)-1/2 (n +4 (ye) (m a 1/9) 


H(py"/ 2) H(gy/@) y ; 


The first assertion of Theorem 3.1 immediately follows from the first assertion 
of Theorem A.1 in [1]. The second assertion is new. 

To derive Theorems 1.3 and 1.4 from Theorem 3.1, take for P(X) and Q(X) the 
minimal defining polynomials of a and 6 over Z, respectively. Theorem 1.3 easily 
follows since 2"~)/? (n+ 1)"/2-"/4 (m+ 1)"/4 H(B)"/? > 1. To prove Theorem 1.4, 
we need to use the upper bound 


la —a,|7! < 312 (n +4 1) PD? Aa)", (3.2) 
which is a restatement of (1.1). If m > 2n — 2, then 


gin-/2 (n+ yee (m+1)"/4 H(a)"/? > gin—D/2 (n+ ie (2n—1)"/4 H(a)""!, 
(3.3) 
Since the right-hand side of (3.3) exceeds 2-!37'/? (n + 1)@"*)/?? H(q)""!, 
inequality (1.2) follows from (3.1), (3.2) and Theorem A.1. 
If H(B) > H(a)?"-/", then 


ga-l/2 (n +4 yee (m +4 14 H(a)y"/? H(p)"/? > H(a)""! 


> (n 4 je |or =a el. 


by (3.2). Combined with (3.1) and Theorem A.1, this gives (1.3). 


Proof of Theorem 3.1 Set P(X) = an(X — a )*!... (XK — ap)” and O(X) = by (X — 
Bi)" ...(X — Ba), where a = a, B = Bi,s = 51,t = th, a, = 1, by > 1, and 
the a;’s (resp. the B;’s) are pairwise distinct. We also assume that a is a Galois 
conjugate of a satisfying 


la—a| <1. 


In particular, s = s; = sy. Denote by Q(X) = b(X — B;)...(X — Bq,) the minimal 
polynomial of 6 over Z, where b > 1. Since the resultant of P(X) and Q)(X) is a 
non-zero integer, we get 


1 < [Res(P,Q1)] = anb" la — |" a2 — Bl...Jay— Bl” [] IPCI. 


2Si<q 
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and, using that |P(6;)| < (n + 1) H(P)(max{1, |B;|})” fori = 2,...,¢1, we obtain 
M(Q1) ) 


max({1, [Bj 
(3.4) 


1 Sa,|a — Bl" - Jar — Bl”... |ap — Bl’? (n+ pene ( 


Observe that 
lo; — B| < 2max{1, |aj|}- max{1,|6|}, forj = 3,...,p. 


Combined with (3.4), this gives 


1 <2" *a,|a — Bl° - jon — Bl* - max{1, |B|}~7° I] max{1, |o;|}" 


35 j<p 


(n+ 1A (P)* MQ)". (3.5) 
We distinguish two cases. If |a2 — B| > 2|a — 8], then 


la —a| > Jan — | — la — p| > MEF 


and, by (3.5), 
la— pl => amd (o - aty| °M(P)! max{1, |a|}* max{1, |a@|}* max{1, | B|}?° 


(n+ NAP) MQ). 


Since q; < m/t, M(Qi) < M(Q)'", MQ) < Vm+TH(Q) and M(P) < 
Vn+ 1H(P), we get 


la — Bi > 2 Gy — a@2|~* max{1, |a|}° max{1, |a2|}° max{1, |BI}?° 


(n+ Iol1/2 mm + 17/09 H(P)-""H(Q)—""". 
(3.6) 


If |az — B| < 2|a — f|, then (3.5) implies 


1 <2” Saqla—B|*-max{1, |B|}-* | [ max{1, Joj|}-(2+ HP)" M(Q), 
j23 


and a similar computation gives 
la — B\?> > 27" FS max{1, Ja]}° max{1, |a|}° max{1, |B} }7° 
(n ne 1 4. 17/0 A (P)-"" HO)". (3.7) 


The combination of (3.6) and (3.7) proves the theorem. oO 
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Theorem 1.5 is a direct consequence of Theorem 3.2 below. 


Theorem 3.2 Let p be a prime number. Let a and B be distinct algebraic numbers 
in Q,, of degree n and m, respectively. For any Galois conjugate a’ of a, distinct 
from a, we have 


la —Blp = n+ IP + I"? Ha) "HB" 
ming lo —al'| "(m+ 1)"8(n + LHC)" (BY. 


Proof of Theorem 3.2 Let P(X) and Q(X) denote the minimal defining polynomials 
of a and f over Z. In view of (1.4), we assume that a and f are not conjugate. 
Consequently, the resultant Res(P, Q) is a non-zero integer. By using Hadamard’s 
inequality, it follows from the expression of the resultant as the determinant of a 
(m +n) X (m+n) matrix that 


|Res(P, Q)| < (m+ 1)"?(n + 1)"?H(P)"H(Q)". 
We deduce that 
|Res(P, Q)|, = (m+ 1)? (n+ I"? H(P) "(QO)". 


Writing P(X) = a(X — a)---(X — @,) and O(X) = b(X — B,)---(X — Bm) with 
a>1,b>1,a@=q, B = fi, this gives the inequality 


Jarlbl, TE lar— Bilp = m+ "P+ 1"? HP)" H(Q)". 
l<i<n,l<j<m 
Observe that, for j = 2,...,m, we have 
lalp |] loi — Bilp = |PCB)|p < max{1, |B)I7}. 
l<i<n 
by (2.2). Thus, we get 
lallol, |] lo—Blp J] maxtt,|Bilp} = (m+1)-"? (n+ 1"? (P) AO)”, 


1l<i<n 2<j<m 


and, noticing that |a;—B|, < max{1, |a;|,} max{1, |B|,} fori = 3,...,n, we deduce 
that 


|a|pla — Bp - jaz — Bly - max{1, |B|p}?- |] max{1, |ailp} 
3<i<n 


x |b, [T] max{1,|Bilp} = (m+ "P+ Do"? APY"). 


l<j<m 
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By (2.1) and the fact that P(X) and Q(X) have integer coefficients, we get that 


Blo}? 
> (m+ 17"? (n + 17"? AP)" (QO). 


Ja — Bl, - lors — Bly - max{1, Jor|p}~'-max{1, Jao|,}~! - max{1, 


If |a. — Bl, > |a — Bp, then |a — a|, = |a2 — BI, and 


-1 


p max{1, |o|,}-max{1, |o2|,}-max{1, IB lp}? x 


la — Bl, = |a — a 


x(m+ iW meeere,: cD Oia H(P) "H(Q)”. 
If ja. — Bl, = |a — B |p, then 


la — Bl; > max{1, |a|,}- max{1, |or2|,}- max{1, [Bi x 


x(m + 1)? (n + 1)-"? H(P)"H(Q) ”. 


The combination of the latter estimates yields the theorem. Oo 
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Guided by Schwarz’ Functions: A Walk 
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Peter Bundschuh and Keijo Vaananen 


Dedicated to the Memory of Professor Wolfgang Schwarz 


Abstract In this paper, transcendence results and, more generally, results on the 
algebraic independence of functions and their values are proved via Mahler’s 
analytic method. Here the key point is that the functions involved satisfy certain 
types of functional equations as Ga(z’) = Gy(z) — z/(1 — z) in the case of 
Gal2) = Yyso et /A — 2") for d € {2,3,4,...}. In 1967, these particular 
functions G,(z) were arithmetically studied by W. Schwarz using Thue—Siegel— 
Roth’s approximation method. 


Keywords Algebraic independence of functions * Mahler’s method 
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1 Introduction 


In 1967, Schwarz [14] studied the arithmetic nature of the particular Lambert series 


(oe) di 


Zz 
Ga(z) = = 7 a 


h=0 


at certain rational points of the unit interval, where d is an integer parameter. 
Using Thue-Siegel—Roth’s approximation theorem he proved: If d > 3,t > 2, and 
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0 <b <t'-*/@9, then Gq(b/t) is transcendental. Under weaker conditions on d and 
b, he obtained irrationality, non-quadracity, etc. Note that, e.g., the transcendence of 


a | 1 
pz a a 2-Ga(>). 


h=0 


the reciprocal sum of the Fermat numbers, was not covered by the results just 
quoted. 

Shortly later, in the first volume of the Journal of Number Theory, Mahler [9] 
roughly reported on earlier work of his on the transcendence of values of functions 
satisfying functional equations like 


Ga(z’) = Ga(z) — i (i. 


(see [6—-8]). In this note, Mahler suggested several directions, in which his work 
might possibly be extended. 

It is the main aim of this article to give an impression of the development, which 
was initiated by Schwarz’ note and Mahler’s reaction. To this purpose, we consider 
coprime! A,B € Q[z] \{0} satisfying A(O) = 0 and put H := 4. This rational 
function has coefficients in an algebraic number field (called K), and convergence 
radius A being +o for constant B, and the smallest absolute value of all zeros of B 
otherwise. Clearly, the series 


Ha(z) = YH") 
h=0 


with integer d > 2 (to be always assumed from now on) is in K|[z]], converges in 
|zZ| < min(1, A), and satisfies the functional equation 


Ha(z") = Halz) — HG) (1.2) 
“of Mahler-type.” Taking H(z) = z/(1 —z) we get Hg = Ga, the Schwarz functions, 


and on taking H(z) = z we get Hg = Fg, where the so-called Fredholm series 
Faz) = Vaso z"" is the prototype of a Mahler function. 


‘Here and in the sequel, Q denotes the field of all complex algebraic numbers. 
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2 Transcendence 


From Mahler’s transcendence criterion in [6] (see also [12, Theorem 1.2]) we 
deduce 


Theorem 2.1 Ha(q@) is transcendental for any a € Q with 0 < |a| < min(1, A) 
and B(a“) # 0 (j = 0,1,...) provided that Ha(z) is transcendental over K(z). 


Note here that something analytical in this direction is necessary: Namely, on 
taking H(z) = z/(1 — 2’) we find H2(z) = z/(1 —z); thus, a transcendence result of 
the above type cannot hold for this H. 

In some good-natured cases, the transcendence of H,(z) follows from classical 
results in function theory, e.g., if H(z) = z. But it is important to have rather 
general criteria for the transcendence of Mahler-type functions. One of those reads 
as follows: 


Theorem 2.2 ({11]) Iff € C[[z]] satisfies f(z“) = of (2) or f@) = W@F(z")) 


for some g, w € C(z, w), then f is either rational or transcendental. 


Thus, under favorable conditions, one has only to exclude rationality. Precisely 
in this direction, Coons [2, Theorem 2.2] deduced quite recently the following 
functional transcendence criterion from Theorem 2.2: 


Theorem 2.3 If f € C[[z]] satisfies f(z’) = f(z) — a with A,B € C[z] \{0} and 


max(deg A, deg B) < d —1, thenf is transcendental over C(z). 


Note that this implies the transcendence of all Schwarz functions Ga(z). Note 
also that, in general, the bound d — 1 for the degrees of A, B is best possible for any 
d as one can see by taking 


d—1| d—1 
Zz —-a 
A(z) = az-————-,, BQ) =a—z4 
Z—-a 


with a some (d — 1)th root of unity, where we are led to the rational function 


[oe 
ys") -— 
a B a-Z 


(see [3, Theorem 9] and [1, Lemma 2.10]) generalizing our example H2(z) after 
Theorem 2.1. 
In the rest of this section, we include a rather shortened 


Proof of Theorem 2.3 W.1.0.g. we may assume A, B coprime and, moreover, A(0) = 
0, B(O) # 0. By Theorem 2.2, it is enough to show f ¢ C(z). Thus, let us assume, 
on the contrary, that f is rational, say f = u/v with coprime u, v € C[z] \{0}. Then 
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the functional equation can be equivalently written as 


u(z")v(z)B(z) = v(z")(u(z)B(z) — v(z)A(2)). 
hence, by the coprimality of u(z“), v(z“), 


v(z4) | v(2)B(z) and u(z“)- rae = u(z)B(z) — v(zZ)A(z) (2.1) 


implying (d — 1) degu < deg B. 

Now, v = const would imply B|A, by the equation in (2.1), hence B = const 
since A,B are supposed to be coprime. Thus, u(z?) = u(z) — (v(0)/B(0))A(z) 
implying ddegu < max(degu,d — 1) hence u = const, and then A = 0, a 
contradiction. 

So the case deg B < d—2 is excluded [see after (2.1)] implying deg B = d—1, and 
this, in turn, leads to degu < 1, where we may immediately assume deg v = | or 
v(z) = az+ B with a F 0. The degree of the left-hand side of the equation in (2.1) 
equals d deg u, whereas the degree of the right-hand side is < max(d—1+degu,d), 
and this consideration yields degu < 1 or u(z) = az + b. Therefore the functional 
equation from Theorem 2.3 reduces to 


AG) Wet e 8 te Vb eae =) 
BQ) a+ B at +B (ac! + Yaz +B) 


with aB # ba since A # 0, whence 
A(z)(az(z! —1) + (az+ B))(az + B) = (ba -— ap)z(z"! — 1)B(z) 
implying degA + 1 =degB < degA = d —2 and, moreover, 
A(z)(az + B)? = Cz) -2(e4! — 1) (2.2) 
with some C € C[z] \{0} of degree 0, hence C(z) = const # 0. Thus, the right-hand 


side of (2.2) has only simple zeros but the left-hand side has multiple ones, and this 
contradiction concludes our proof. Oo 


3 Hypertranscendence 


An analytic function is called hypertranscendental if no finite collection of deriva- 
tives of the function is algebraically dependent over C(z). A hypertranscendence 
criterion for Mahler-type functions is the following: 
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Theorem 3.1 ({10, Theorem 3]) Suppose that f € C|[z|] has the following two 
properties: 


(i) For some integer n > 1, let f,f’,...,f~ be algebraically dependent over 
C(z). 

(ii) For some integer d > 2, f satisfies f(z4) = u(z)f(z) + v(2), where u,v 
C(z),u 4 0. If u(z) = syz@ +... with an integer M and sy # 0, put Q = 
[M/(d — 1)]. 


fa) 


Then there exists some w € C(z) satisfying 


uj (z)z@4 


u2(z) 


w(z4) = u(z)w(z) + v(z)_ or w(z4) = u(z)w(z) + v2) — y 


’ 


where u;(z) = u(z)/(smz),u2 € C(z) \ {0} fulfills the condition uy(z“) 
uy(z)/uy(z), and y € C is the constant term in the z-expansion of the quotient 
v(z)u2(z) / (uy (z)z2*) in case sy = 1,M = O(d—1), but y = 0 otherwise. 


II 


From this criterion we deduce the following consequence: 


Corollary 3.2 If f € C[[z]] \ C(z) satisfies f(z) — f(z) € C(2), then f is 


hypertranscendental. 


Proof Let f, as in the corollary, satisfy f(z“) — f(z) € C(z), = v(z) € C[fz]], say, 
hence v(0) = 0. Assume that (1) from Theorem 3.1 holds. Since (ii) holds also, with 
u(z) = | (hence M = 0,sy = 1,Q = O) and v(z) as above, there exists some 
w € C(z) satisfying 


w(z4) = w(z) + v(z) (3.1) 


(as soon as we have checked y = 0: u;(z) = 1;uz € C(z) \ {0} fulfills u2(z4) = 
u2(z), or U(z4)V(z) = U(z)V(z4) with uz = U/V, U, Ve C[z]\{0} coprime, whence 
uz = const € C%*, and thus y = 0). 

By (3.1), w(z) has no pole at 0, hence it is in C[[z]]. Since f satisfies also (3.1), 
g := f —w € C[[z]] satisfies p(z“) = g(z), whence g = const, and we arrive at the 
contradiction f = w + const € C(z). Oo 


In particular, all Gy(z) and F(z) are hypertranscendental. 
To see what this implies arithmetically, we quote the following algebraic 
independence criterion: 


Theorem 3.3 ([12, Theorem 4.2.1]) Let K be an algebraic number field. Suppose 
that f\,...,fn € K|[z]] converge in |z| < p with some 0 < p < 1, where they satisfy 


THO ah) =A AOscshO+' * Ci@incshey G2) 


with A € Matyxn(K(z)), b1,...50n € K(z), and t indicating the matrix transpose. 
Ifae Q with |a| < p is such that no a@ (j = 0,1,...) is a pole of by,...,b, and 
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the entries of A, then the following inequality holds for transcendence degrees 
trdeggQ(fi(@), ....fn(@)) = trdegg(K(z)(fi 2), --- .fn(2)). (3.3) 


We apply this with f(z) := f“~) (2), f being an irrational solution of 


f@)=f@-H®, H= . (3.4) 


with A, B as in our introduction [compare (1.2)]. According to Corollary 3.2, f is 
hypertranscendental, whence the right-hand side of (3.3) equals n for any n > 1. By 
successive differentiations of (3.4), we recognize 


AD FE) 25f" 9) =" Oral” "OVS" EG") 


with lower triangular A(z) € Matyx,(Z[z]) having 1,dz4—!,..., (dz!) 
on the main diagonal. From this, (3.2) can be easily checked with A(z) := 
A(z)!, *(bi(2),..-,bn(z)) = —A(z) - *(A(2),..., HP (2). Clearly, A has no 
poles # 0, whereas non-zero poles of b; can only come from zeros of B, and we 
have established the following result: 


Corollary 3.4 Suppose thatf is an irrational solution of (3.4), where A, B € Q{z] \ 
{0}. Then, for any a € on with |a| < min(1, A) and B(a*’) # 0 forj = 0,1,..., 
the numbers f (a), f’ (a), f” (a), ... are algebraically independent. 


In particular, for any d anda € Q, |a| < 1, the numbers Gz(a), G/(a), 
G'(a),... are algebraically independent. 


4 Algebraic Independence of the Values of Schwarz’ 
Functions 


The most typical question here is as follows. For a Mahler-type function f, analytic 
on D := {z € C: |z| < 1}, say, one is interested in necessary and sufficient 
conditions on a,...,a@, € Q MD such that f(a;),...,f(a,) are algebraically 
independent. This problem was essentially solved for the Fredholm series Fy, d > 
2, by Loxton and van der Poorten [5]. But in the case of Schwarz’ functions Gg, no 
significant results seem to exist in the literature, at least to our knowledge. We want 
to make here a modest contribution to this problem. 

To begin with, we conclude analogously to [12, pp. 106-107] as follows. 
Suppose 1,...,Q@, € Q M D to be multiplicatively independent, and consider 
the multivariable functions fj(z|,...,Z1) := Ga(z), j = 1,...,n. Since Gg(z) 
is transcendental over C(z), the functions f,(z),....fn(z) of the multivariable 
Z 1= (z,...,%) are algebraically independent over C(z). Moreover, 92. := 
diag(d,...,d) € Mat, ,(Zso) and @ := (@1,...,Q,) satisfy the properties (1) 
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through (IV) from [12, pp. 33-34, 62], whence we obtain the algebraic indepen- 
dence of Gyg(a1),...,Ga(a,) if, as assumed above, the numbers a,...,@, are 
multiplicatively independent. Note that the main feature here consists in the trick 
to deduce algebraic independence results for one single-valued function at different 
points from corresponding results on several multivariable functions at one point. 
In a very particular subcase of multiplicatively dependent a,..., a) € Q nD, 
we are in a position to give a characterization of the algebraic independence of 
Ga(a),..., Ga(@,), namely if all a;’s are powers of some a € on ND. 


Theorem 4.1 Let m,...,m, ben > 2 positive integers, and let a € on MD. Then 
Ga(a"™),..., Ga(a”") are algebraically independent if and only if 


i g dé (4.1) 
mj 


holds for any pair (i,j) with i F j. 


To prepare our proof below, we next quote the one-variable version of a main 
tool for algebraic independence of Mahler-type functions. 


Theorem 4.2 Let fi,...,f, € C[[z]] satisfy the functional equation 


Hi Orsay ANE occ)" Oi@)jerssba@) (4.2) 
with A € Matyxn(C) and by,..., bn € C(z). If fi, ...,fn are algebraically dependent 
over C(z), then there exist c,,...,Cyn €C, not all 0, such that cyfi+...+¢nfn € C2). 


Proof This is a particular case of [12, Theorem 3.2.2], a forerunner of which was 
[4, Theorem 2]. 


Proof of Theorem 4.1 We consider the functions 
fi@ = Ga") (G=1,...,n) (4.3) 


satisfying, by (1.1), 


fl) =f@ +55 (j=1,....7) (4.4) 


which is a system of functional equations of type (4.2). Iterating (4.4) we find 


sna oe 
fi) =4@+>> 
A=0 


dim; = 


for any integer € > 0, empty sums being 0, by convention. Assuming w.l.o.g. 
m2/m, = d°, this equation yields Gg(a’”"?) — Gy(a’") € Q, by (4.3), where the 
explicit value of this difference can be written down. Thus, the validity of (4.1) is 
necessary for the algebraic independence of Gg(a’"'),..., Gg(a’""). 
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That conditions (4.1) are sufficient for the algebraic independence of the 
numbers just mentioned can be deduced from Theorem 3.3 as follows: Apply- 
ing this theorem with K = Q,p = 1 and (4.4) as system (3.2), we obtain 
trdeggQ(fi (a),...,fn(@)) = n, by (3.3), hence our assertion, if we can show that 
our fi(z),....fn(z) are algebraically independent over Q(z). 

To prove even their algebraic independence over C(z), we use Theorem 4.2 which 
tells us the following. Iffi,...,,f, were algebraically dependent over C(z), then there 
exists ac := (c1,...,Cn) € C” \ {0} such that r(z) := cyfi(z) +... + eafr(Z) isa 
rational function satisfying the functional equation 


mj 


d : z 
= ;——_., 4.5 
r(c) = r(z) + aoe = (4.5) 
j=l 
by (4.4). 

To get our desired contradiction more conveniently, we next transform, for our 
above c # 0, Eq. (4.5) in r, m1, ..., mM, into an equivalent one ins, ki, ..., k,. To this 
purpose we write, forj = 1,...,n, mj = dD; with integers t(j) > 0 and kj > 0 
such that d t kj. Then condition (4.1) is equivalent to the distinctness of kj,..., kn. 
Moreover, with c and r as in (4.5), we define the rational function s by 

n=l atk 
s@) = 1@)— ia )) aE 
j=l t=0 
which satisfies 
d ee 4.6 
= + ———, 
s(z!) = s(2) 2, oe (4.6) 


Therefore, to reach our contradiction, it suffices to establish the following auxiliary 
result the proof of which we defer to the last section. Oo 


Lemma 4.3 If c := (cj,...,¢n) € C”" \ {0}, and k,...,k, are distinct positive 
integers not divisible by d, then the functional equation (4.6) has no rational solution 
S. 


This may be the right place to ask an 


Open Question Let a,...,Q@, € OQ. ND. Are the following two statements 
equivalent? 


(i) Gg(a1),..., Ga(Q,) are algebraically independent. 
(ii) aj F at! holds for any triple (i,j, £2) with i 4 j and £ > 0. 


Of course, the implication (1) = (ii) is easily seen. In the particular case a = a”! 
treated in Theorem 4.1, the reversed implication is also valid. 
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5 Proof of Lemma 4.3 


Step 1: Partial fraction decomposition. Assume on the contrary, that, for a 
(c1,..-,€n) ¥ 0, Eq. (4.6) with distinct k; not divisible by d has a rational 
solution s. Since all polynomials 2 — 1 divide z — 1, L := lem(k,,..., kn), 
it follows from [13, Lemma 1] that s must be of the shape 


a(z) 


ze] 


s(z) = 


with some a € C[z], a € 0, by (4.6) and the distinctness of the k;. Note also 
dega < L. 


For integers k > 0, we write C; := e?”'/* and have 


= i 1 
—_ = 1+4+- -=l|14+- Z), 


-) 
Si lk 


the second sum being over all positive divisors 6 of k, where we put 


5-1 j 
A@= > ; ae 
j=0 2 Ss 


Here the poles of fs are exactly the primitive 6th roots of unity, i.e., the roots of the 
6th cyclotomic polynomial. 
In the same way, we may write down the partial fraction decomposition of s as 


6-1 


s(z) =S+ Y s5(2) with 53(z) = > 58 7 


SIL j=0 %~ 55 


where S and the s3;’s are complex constants. 
By the preceding notations, our functional equation (4.6) assumes the form 


Y (se) - 5@) = OF KO. (5.1) 


IL iV 7 blk; 


where we already used c} +... + c, = 0, a result coming from the fact that both 


sides of (5.1) tend to 0 as z > oo. W.Lo.g., let us assume c)-.. «+ Cin A O but Cn41 = 
... = C, = 0 for 2 < m <n, and furthermore kj >... > k,,. Next, consider the set 
of all positive integers 6 dividing at least one of k,..., km, where none of these 6’s 


is divisible by d since we assumed d + k; forj = 1,...,n. Clearly k; is the maximal 
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element of this set and cannot occur among the divisors of k2,...,km. Thus, all 
poles of f,, remain poles of the right-hand side of (5.1), and we may summarize the 
result of Step | as follows: Denoting by N the greatest positive integer such that the 
left-hand side 


>> (ss) - s8@) (5.2) 


3iL 


of (5.1) and fy have at least one pole in common, then N = k; holds, whence d } N. 


Step 2: Study of ss(z“) and final contradiction. From the above definition of s3 for 
positive divisors 6 of L we obtain 


5-1 é=1 


dy _ Sof ——————— 
ss(Z) = 2 a a », wg cor : FEROS (3.3) 
ae =>; re G=te ) 
j= j= 


Suppose, from now on, pe ast oe to be the canonical factorization of d. Assume 


that p1,..., Po are not divisors of 6 but po+1,..., Dw are, where we have to consider 
the cases 0 = 0,...,@. Then we have the following equivalence: 


(7,6) =1 — > G+ «b,6) = (j+ «6d, d5/T_, VS 


with the usual convention here and later that empty products equal 1. Now, any 
positive divisor D of p, -...-+ Do is relatively prime to 6, whence there are precisely 
4 numbers x € {0,..., d — 1} satisfying D|(j + «6). Thus, by the well-known 
inclusion—exclusion principle, we can say that, for fixed coprime j, 6, the number of 
k € {0,...,d—1} such that j+«6é is prime to py -. . ..pg (or equivalently to []7_, po) 
equals d[[7_,(1 — 1/p;). Therefore we can note ° that, for fixed coprime j, 6, there 
are exactly d[]?_,(1 — 1/p;) values x € {0,...,d — 1} such that (j + «6,d6) = 1 


holds. Hence we conclude 


di—1 


53(z4) = d a + 55(z) (5.4) 


(j.d3)=1 


from the double sum in (5.3), where, strictly speaking, we should write 35 j-[)/3]s 
in the numerator instead of simply s3;. The rational function »; in (5.4) vanishes 
identically in case 0 = 0, whereas, in the cases | < o < a, it may have poles at 
certain primitive pth roots of unity but with p < dé only. Since ss 4 0 is equivalent 
to the fact that not all ss ;, 7 € {0,...,d5 — 1} and prime to dé, vanish, we conclude 
from (5.4) that, in this case of 6, the difference s3(z“) — ss(z) has poles at (d8)th 
roots of unity. Thus, the NV defined before (5.2) must be of the form dé, whence d|N 
holds, and we have obtained a contradiction proving Lemma 4.3. 
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Sums of Two Squares and a Power 
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Dedicated to the memory of Wolfgang Schwarz, with admiration 
for his broad interests, inside and outside mathematics. 


Abstract We extend results of Jagy and Kaplansky and the present authors and 
show that for all k > 3 there are infinitely many positive integers n, which cannot 
be written as x7 + y* + z‘ = n for positive integers x, y, z, where fork 4 0 mod 4a 
congruence condition is imposed on z. These examples are of interest as there is no 
congruence obstruction itself for the representation of these n. This way we provide 
a new family of counterexamples to the Hasse principle or strong approximation. 
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Looking at Schwarz’s own PhD thesis, see [11, 12], which is on sums of prime 
powers, i.e. on the Goldbach—Waring problem, one finds a great number of results, 
one of those being the following (Theorem 3 of [12]): 

For fixed k > 1 let S,(N) be the set of positive integers n, with 


3<n<QN, 

n #0 mod 2, n € 2 mod 3, for oddk 

n = 3 mod 24, for even k, 

n#0Omod 5, for k = 2 mod 4 

n # 0,2 mod 5, for k = 0 mod 4 

n# 1modp, for each p = 3 mod 4 with (p — 1) | k. 


Then the number of integers n € S;(N) not of the form 


n=pit+p3+D}, 


0 N 
(sam) : 


This improved on a result of Hua [8, Theorem 1], who proved this with B = oe AS 
we had worked earlier on solutions of x? + y? + z* = n, it is due to this connection 
that we have chosen to contribute the present note to the volume in Memory of 
Wolfgang Schwarz. 

As it turns out, also one of the first named author’s PhD advisors worked on this 
kind of problem in his PhD thesis: without restricting the variables to primes, one 
should be able to obtain stronger results, and indeed, improving on earlier work 
pioneered by Davenport and Heilbronn [3] and further developed by many other 
authors, Briidern [1] has shown that there are at most O(N Inpte) integers n < N 
with no solutions of 


is, for all B > O, at most 


n=xrtyte, (1.1) 


where 7 is not in a residue class excluded by congruence obstructions. For a survey 
of results on sums of mixed powers, see also [2] and [14]. 

It was generally expected that for all sufficiently large n the Hasse principle for 
Eq. (1.1) holds true, i.e. for all such n satisfying the necessary congruence conditions 
there would exist a solution of (1.1) in positive integers, see, for example, Chap. 8 
in [13]. However, in 1995 Jagy and Kaplansky [9] shattered this belief by proving 
that for k = 9 and some positive constant c there are at least cS positive integers 
n < N that are not sums of two squares and one kth power. In fact, their method 
works for any odd composite number k, but not for the other cases of k. In [4] we 
proved that a similar restriction holds for k = 4. That approach actually generalizes 
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to all k divisible by four (see Theorem 3.1), and by slightly modifying it we can not 
only get a bigger set of exceptional n but we can also handle k not divisible by four; 
to be more specific, we prove that (1.1) does not satisfy ‘strong approximation’: For 
k = 2 (mod 4), k > 6and sufficiently large N we show that there are asymptotically 
> N'/?/(logN)!/? positive integers n < N for which Eq. (1.1) has no solution with 
z fixed into a certain residue class, though there are no congruence obstructions (see 
Theorem 3.2). For odd k > 3 we show that there are asymptotically at least ste 
such exceptional positive integers n < N (see Theorem 2.1). 

Let us further mention that Hooley [7] investigated sums of three squares and a 
kth power, Friedlander and Wooley [5] sums of two squares and three biquadrates, 
and Wooley [15] sums of squares and a ‘micro square’, in connection with a 
conjecture of Linnik. In this connection we would like to add a seemingly forgotten 
old reference: Theorem 7 of Rieger [10] states that the number of integers n < N 
which can be written as n = x7 + a + z* where z < F(N), and F is a function 
tending monotonically to infinity, with F(n) < ./logN, is >i = in other 
words, as good as it can be. 

The authors are grateful to Tim Browning, Jérg Briidern, Roger Heath-Brown, 
Jan-Christoph Schlage-Puchta, Dasheng Wei and Trevor Wooley for interesting 
discussions or observations. 


2 Two Squares and an Odd kth Power 


Theorem 2.1 Let k > 3 be odd. Let p be a prime with p = 1 mod 4k. Then there are 
no integers x, y, z, positive or negative, with x? + y* + z = p* and z = 2k mod 4k. 


Proof Assume there are solutions, then x? + y? = (p — z)(pk"! + pk?z4 +++ + 
pe? + 2-1). If z = 2k mod 4k, then p — z = 2k + 1 mod 4k. Since k is odd, 
2k + 1 = 3 mod 4. Hence p — z must contain a prime divisor g = 3 mod 4 with odd 
multiplicity. Note that gcd(q, k) = 1, as otherwise g|k and 0 = p—z=2k+1= 
1 mod gq gives a contradiction. 

Recall that by the general classification of integers which are sums of two squares 
the integer x? + y* contains prime factors g = 3 mod 4 with even multiplicity only. 
Therefore both p — z and p*! + pz +... + pz? + 2! are divisible by g. With 
p =zmod q it follows that 


pe + pe2gt e+ pe? 4 41! = ke! = 0 mod g. 


This implies that g | z and hence g | p, which is impossible, as gq = p would 
contradict p = 1 mod 4. 

Also note that there are no congruence obstructions that would imply that in 
x? + y? + 2k = p* there are no solutions with z = 2k mod 4k. 

To see this first observe that for a fixed odd prime q one can choose an integer 
z = 2k mod 4k such that q is coprime to p* — z‘; similarly, for g = 8 just choose 
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z = 2k. For this fixed z the congruence x? + y* + z* = p* mod q has a nonsingular 
solution in x and y which by Hensel’s lemma can be lifted to a g-adic or 2-adic 
solution, respectively. 


By the prime number theorem in arithmetic progressions, the number of such 
examples, p* < N with p = 1 mod 4k, is asymptotically 


i [- dt k NYE 
(4k) Jo = logt =. 2@(k) logN’ 


3 Two Squares and an Even kth Power 


3.1 Two Squares and a kth Power, k = 0 mod 4 


Theorem 3.1 Suppose that 4 | k and let p be a prime with p = 7 mod 8. Let 
n= 1 mod 8 be either | or consist of prime factors congruent to | mod 4 only, and 
assume that n < p. Then there are no positive integers x,y,z with x° + y? + # = 
(np)’. 


Proof Let k = 2t, where t is even. Assume there are solutions, then x? + y? = (np— 
z')(np +z’). If zis even, then np—z' = 3 mod 4. If zis odd, then np—z' = 6 mod 8. 
In both cases np—z' must contain a prime divisor g = 3 mod 4 with odd multiplicity. 
Therefore, as in the proof of Theorem 2.1, we conclude that both np — z' and np + 2 
are divisible by g. Hence their sum 2np and their difference —2z’ are also divisible 
by q. Since 2n ¥ 0 mod q, and since p is prime: p = q, and since z # 0: q divides 
z. But this gives a contradiction: 


ety tz > gq > q* > (ng) = (np). 


Let us give an estimate of the number of integers np < N, with n = 1 mod 8 
consisting of prime factors 1 mod 4 only, and < p. 

Recall that by a theorem of Landau the number of integers n < N consisting of 
prime factors 1 mod 4 only is of order of magnitude gust?! and about one half of 
these numbers satisfy the congruence restriction n = | mod 8. 

Let f : N — {0, 1} be the characteristic function of these integers n, i.e. we put 
f(a) = 1, ifn = 1 mod 8, and all prime factors of n are 1 mod 4; otherwise, we put 
f(n) = 0. Now 


7 N/p au 
df= 2, I> Dd Tawny? > Tog?’ 


np<N,n<p n<N/p,n<p N!/2<p<N3/4 
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where we used that 


1 
-= log log N*/4 —loglogN'/? + o(1) = log(3/2) + o(1) > 1. 
N!/2<p<Nn3/4 


(In view of Landau’s theorem this order is the right order of magnitude.) Hence the 
number of exceptional (np)? < N provided by Theorem 3.1 is >> TE 


Note that as for Theorem 2.1 one can check that there are no congruence obstruc- 
tions for the representation of (np). 


3.2. Two Squares and a kth Power, k = 2 mod 4 


Theorem 3.2 Suppose that k = 2 mod 4, k > 6 and let p be a prime with p = 
7 mod 8. Let n < p be an integer either | or consisting of prime factors congruent 
to 1 mod 4 only, andn = 1 mod 8. Then there are no positive integers x, y,z, where 
2|z withx? +y? +2 = (np). 


Proof The proof is almost verbatim as above. 


Let us remark that as above one shows that the number of exceptional (np)? < 
N provided by Theorem 3.2 is > roan Further note that in a similar way as 
for Theorem 2.1 one observes that there are no congruence obstructions for the 
requested representation of (np). 


4 Afterthought 


A major part of the paper was actually written around 2007/2008. We had shown 
earlier versions of this paper to several colleagues, hoping that someone would write 
a more detailed explanation based on tools from arithmetic geometry such as the 
Brauer—Manin obstruction. Indeed, in this way the question has come to Fabian 
Gundlach [6] who was very recently able to give a detailed and general account. 

As Gundlach refers to our work as an unpublished manuscript, and as our proofs 
use a much less sophisticated language, it seems desirable to have this paper in final 
form. The main part of this paper is a slightly improved version, compared to the 
manuscript Gundlach referred to. In particular, the version cited by Gundlach [6] 
had in Theorem 2.1 the same statement and proof with p~* rather than p*. Also in 
Theorems 3.1 and 3.2 we now have an additional factor n, thanks to an observation 
of J.C. Schlage-Puchta. In other words, the current version gives slightly stronger 
results. 
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Abstract Mean value theorems for multiplicative arithmetic functions are applied 
to demonstrate uniformity of sign changes in the Fourier coefficients of automorphic 
forms. 
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1 Introduction and Background 


Wolfgang Schwarz’ serious interest in the theory of multiplicative functions is well 
illustrated in the wonderfully wideranging 1994 volume Arithmetical Functions 
that, together with Jiirgen Spilker, he wrote for the London Mathematical Society 
Lecture Note Series [9]. 

Classical multiplicative functions are usually supported on almost all the primes. 
In the present paper I consider multiplicative functions that may have scattered 
values and slender support, and apply them to establish uniform distribution of sign 
changes in real Fourier coefficients of automorphic cusp forms. 


P.D.T.A. Elliott (>) 
Department of Mathematics, University of Colorado Boulder, Boulder, CO 80309-0395, USA 
e-mail: pdtae @euclid.colorado.edu 


© Springer International Publishing Switzerland 2016 109 
J. Sander et al. (eds.), From Arithmetic to Zeta-Functions, 
DOI 10.1007/978-3-319-28203-9_8 


110 P.D.T.A. Elliott 


We begin with the following result of Elliott and Kish [4]. 


Theorem 1 Let 3/2 < Y < x. Let g be a complex-valued multiplicative function 
that for positive constants B, c, c, satisfies |g(p)| < B, 


Y> (ig(pPl-e)p =a. Yswsx, 


w<psx 


on the primes. Suppose, further, that the series 


Yils@lq dogg)’, «=1+cB(e+ By", 
q 


taken over the prime-powers q = p* with k = 2, converges. 


Then with 
me 3 Y> (1g(p)| — Re g(p)p") p'. 
~  Y<p<x 
di a(n) « xlogxy' T ] (1 + Ig(p)Ip“!) (exp(—Acle + B)!) + T-”) 
n<x psx 


uniformly for Y, x, T > 0, the implied constant depending at most upon B, c, Cc, and 
a bound for the sum of the series over prime-powers. 


Remark Since the partial sums ae ee ([g(p)| — Re g(p)p") p~! are continuous in 
t and non-decreasing in x, their divergence for each real ¢ ensures that 


EO) =0 (soe I] (1+ a) , x20. 


nsx psx 


Specialising: 


Theorem 2 Let g be a real-valued multiplicative function that satisfies the condi- 
tions on g(p) in Theorem I for Y = 3/2 and all x > 2, and for which the series 


» (g(p)| — (Pp) p diverges. 
Then the asymptotic estimate in the above remark holds. 


Further specialising, we obtain a result that I shall directly employ. 


Theorem 3 Let the multiplicative function g assume values 0, 1, —1, and with 
positive constants c, c, satisfy 


>> (s(p)|-Op! = -«1, 


u<psv 
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uniformly for 2 < u < v. Assume, further, that the series }~ p—!, taken over the 
primes for which g(p) = —1, diverges. 
Then 


Yo a(n) = 0 (soe []@+ cle") , X00. 


n<x psx 


Remark Integration by parts shows the lower bound hypothesis on |g(p)| in Theo- 
rem |, likewise in Theorems 2 and 3, to be amply satisfied if )~ nex (OCP) p 'logp = 
clogx — co for some cz > O and all x > 2. 


Proof of Theorem 1 A complete proof of Theorem | may be found in Elliott and 
Kish [3, Theorem 2]. 

It is shown in Elliott and Kish [3, Lemma 14], that for a certain constant c3 and t 
real, 


2 log log(2 + |#/) +3, |¢llog2 > 1, 
Re > ors — log |¢| + ¢3, (log x)! < |t| < (log2)7, 


2<p<x 


loglogx + c3, |t}logx < 1, 


uniformly for x > 2. 
Denote this upper bound by 6(¢) and let L = >, yee p_'. To deduce Theorem 2 
we shall apply the following result, a special case of Elliott and Kish [3, Lemma 15]. 


Lemma 1 /f for some 6, 0 <6 < 1, t real with h(p) in the interval (0, 1], 


> A(p)p7'|1— p"? < 81, 


2<p<x 


then 


S> A(p)p! < 4817 + 30(28- 71) + 30(0). 


2<p<x 


Proof of Theorem 2 Assume that for a non-zero real t the series )> (|g(p)| — Re g(p) 
p")p! converges. For z in the complex unit disc |1 — zi" < 2(1 — Rez), and the 
partial sums }°,_,.<, |g(p)||1 — p"|’, x = 2, are uniformly bounded. 

We may apply Lemma | with h(p) = B7'|g(p)| and 46!/3 = c/2. In view of 
our lower bound hypothesis on the values of |g(p)|, loglogx « 0(267!/31) + @(t) 
which, for all sufficiently large values of x, is untenable. 

We may now apply Theorem 1. 

We may appreciate Theorem 2 by recalling two results of Wirsing [10]. 
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Satz 1.1 Let A(n) be a non-negative real-valued multiplicative function that for a 
positive t satisfies 


1 
y. aL e ~tlogx, x00. 


psx 


Let the A(p) be uniformly bounded and the series )~ 4(q)q~', taken over the prime- 
powers q = p* with k > 2, converge. Moreover, if t < 1 then, with the same 
convention, let >, —.A(q) « x(logx)~! hold. 


qx 
Then 
ev x Mp) Ap) 
oe (Se ee ; 
sy ” ron re aa 


where y denotes Euler’s constant. 


Satz 1.2.2 Let A(n) satisfy the conditions of Satz 1.1, and let g(n) be a real-valued 
multiplicative function that satisfies |g(n)| < A(n) on the positive integers. 
Then 


—1 [o.e) [o.e) —1 
Jim Dos) (x 20») =|| (: + Seton) ( +>) are") 


n<x n<x Pp k=1 k=1 


where the product either converges properly to a non-zero limit, or improperly to 
zero. 


If, for 0 < a < f, we define the multiplicative function 4 to be a, 6 on the 
primes in alternate intervals [exp(2*), exp(2‘t!)], k = —1,0,1,2,..., and to be 
zero on all other prime-powers, then, as x increases through values exp(2”), 


(logx)~' 5° p~'g(p) log p 


psx 


approaches (a + 28)/3 or (6B + 2a)/3 according to whether m is confined to odd 
or to even integers. 

Satz 1.1 and Satz 1.2.2 are then not applicable. 

None-the-less, it is clear from Theorem 2 that for the multiplicative function g 
defined to be —a@ on those primes for which A(p) = a, and to coincide with A on 
the remaining prime-powers, the conclusion of Satz 1.2.2 still holds. We do not need 
an asymptotic estimate for the mean of A(n), only an appropriate lower bound; and 
that can be provided by the following basic result. 


Lemma 2 Let g be a non-zero, non-negative real multiplicative function, uniformly 
bounded on the primes and for which the series ¥~ g(q)q_', taken over the prime- 
powers q = p* with k = 2, converges. 
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Assume, further, that for a positive d and all sufficiently large x, 


Lipex 8(p) logp = dx. 
Then >, <, g(n) lies between positive constant multiples of 


mE 


psx k=1 


n<x 


logx 


uniformly for x > 2. 


Proof of Lemma 2 c.f. Elliott and Kish [4, Lemmas 19, 20 and 21]. 

With t = 1, Satz 1.2.2 of Wirsing settled in the affirmative a conjecture, ascribed 
to Wintner and Erdos, that for a multiplicative function g that assumes values 0, 1, 
—1, the limit limy.0x7' ©), -, g(n) always exists. Of particular interest was the 
case when g is Mobius’ function jz since, as was already known to Landau at the 
beginning of the twentieth century, its validity led rapidly, and by elementary steps, 
to a proof of the rational prime number theorem. 

Supposing the conclusion of Theorem 2 or 3 to be valid, it is not clear what more 
might be needed to guarantee even a weak estimate }~ a le(p)|p_ | logp ~ tlogx, 
x — oo, on the primes. 


2 Application 


Let 


f =) anQny)'?K,(2n|nly)er"™ 
n#0 


be a Maass cusp form, a newform for the action of the congruence group I(\) 
upon the complex upper half-plane H with trivial character, an eigenvector of the 
appropriate Hecke operators and normalised to have aj = 1; a solution to the 


Laplacian equation 
of of 1 
{oF , es \)_ fe _ yp 
* (as tas) = G-)! 


that belongs to the Hilbert space L7(I)(N)\H) with Petersson inner product. 

We identify f with a cusp form attached to an irreducible cuspidal automorphic 
representation of GL2(Ag), Ag the rational adéles. We assume the representation 
to have trivial central character and not to be of dihedral, tetrahedral or octahedral 
type, c.f. Kim and Shahidi [7]. 

Let M be a finite collection of places including the archimedean and the non- 
archimedean whose local representation is ramified. Let D be the product of the 
corresponding rational primes. I shall deem the rational primes not dividing D to be 
unramified. 
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For k = 1,2,..., denote by L(s, sym‘) the L-function attached to the extension 
of the representation onto symmetric tensors of rank k, and by Ly(s, sym‘) its Euler 
product restricted to unramified primes. 

Let ye b,n *, for Re(s) sufficiently large, be one of the restricted L-functions, 
Lu (s, sym‘), k = 1,2,3. Our assumptions guarantee the coefficients b,, indeed the 
corresponding coefficients for every k > 1, to be real. Denote by B(x) the sum 


Dscapto 1. 
Remark If N = 1, then D = 1, 


CO 
Ly(s, sym!) = Yo ayn 
n=1 
and all the coefficients a, are real. 


Theorem 4 There are positive constants c4, dj < 1, dz, such that B(x) = 
c4x(log x)“ moreover 


Bix)! SY) 1=1/2+ 0 ((logx)”) 


n<x,bn<0 


uniformly for x = 2. 
The positive and negative coefficients b, are uniformly distributed amongst those 
that do not vanish. 


The arguments being similar, I confine myself to the case of L(s, sym’). 
The following result of Heilbronn and Landau [5, 6], will be very helpful. 


Lemma 3 Suppose that the Dirichlet series 


lo.) 
F(s) = ane en 
1 


n= 


with A, < Ax < +++, An > ©&, converges for Re(s) > 1 and that a, > 0. Assume 
that F(s) — (s — 1)~! is regular at s = 1. Then 


> ane" = y+ O(). 


An <y 


The proof of Theorem 4 requires a number of preliminary results. Consider the 
restricted Euler product 


ta(s.sym') = TT (1 app)" (1 Boy" 


(p.D)=1 
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where a, + f, is real, a»6, = 1. A central issue is the generalised Ramanujan— 


Petersson conjecture that |a,| = |B,| = 1, c.f. Kim and Shahidi, loc. cit.; but I shall 
arrange matters to avoid explicit appeal to the size of the individual parameters a, 
Bp. 


In what follows q will denote a prime-power, go the prime of which it is a power, 
(qo, D) = 1. 

On the unramified primes define functions p(p*) = ak + e, v(p*) = a + 
ak + - + Bk = 1,2,.... Since the polynomials defining p, y are symmetric 
in a, Bp, they are also polynomials in a, + £,, @,8, and, after our assumptions on 
the coefficients of Lyy(s, sym'), real. Note that for unramified primes p, y(p) is the 
coefficient of p~* in Ly(s, sym*), but w(p?) will not in general be the coefficient of 
p °°. Likewise, p(p) is the coefficient of p~ in Ly(s, sym'), but p(p*) need not be 
the coefficient of p~*’. 


Lemma 4 


Y> p(q)’q7' log go = 14 logx + O(1), 


qx 


>> wig)’ | log qo = logx + O(1), 


qsx 


uniformly for x = 2. 


Proof By direct calculation 


h(s)=exp{ Dd) p(p)*(kp)" 


p.k,(p,D)=1 
= Ly(s, sym®) Ly (s, sym®)’Ly (s, sym*)° Lig (s, sym?)78 Emu(s) ee 


In view of the work of Kim and Shahidi, loc. cit., each of the restricted symmetric 
product L-functions is analytic in the half-plane Re(s) > 1, analytic and non-zero 
in a proper disc |s — 1| < c; so, therefore, is h(s)¢(s)~'*. For a suitable choice of 
constant K, log (Kh(s)¢(s)~'*), with principal value of the logarithm, is analytic in a 
similar region. Differentiating and appealing to standard properties of the Riemann 
zeta function, we obtain about the points s = 1 a Laurent expansion 


Y- p(q)8q7* log qo = 14(8— 17! + co +o 
q 


Since the coefficients p(q)* log qo are real and non-negative, we may appeal to 
Lemma 3 with the A, running through the log gq, to justify the first assertion of 
Lemma 4. 
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To justify the second assertion we likewise consider the function 


exp} > w(p')? (kp)! 


p.k,(p,D)=1 
= Ly(s, sym? x sym?) = Ly(s, sym®)Ly(s, sym*)Ly(s, sym?) fy(s). 
Lemma 5 Fora positive constant c and all sufficiently large values of x, the interval 


[2, x] contains at least cx/ log.x primes on which w(p) does not vanish. 


Proof Appeal to the second estimate in Lemma 4 and an application of Hélder’s 
inequality shows that 


log(x/y) +O) = > w(q)’q"! log go 


Y<qSx 
1/4 3/4 
= >> ‘log go ( YS Ww@e?eq? ea) ; 
Y<qEx.W(Q#0 Y<qEx 
uniformly for 2 < y < x. 
By direct calculation w(q) = _ p(q) (p(q)" - 2). Moreover, for real x, 


(x(x? — ay < max(2°3~7,x°). The second of the two majorising sums does 
not exceed 


Y~ (0(q)® + 2) q7! log qo 


y<qsx 


which, by the first estimate of Lemma 4, is 16 log(x/y) + O(1). 
We choose y = ex and fix ¢ at a value sufficiently small that for all large x the 
first of the two majorising sums exceeds 2~!? log(1/e) = 5 > 0. Then 


Y 1  edx(logx)™! 


ex<q<x,w(q)40 
and the assertion of Lemma 5 follows rapidly. 


Lemma 6 For a positive constant cs, 


Y> pp! > csloglogx + O(1), 
p<x.(p)<0 


uniformly for x > 2. 
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Proof Again appealing to the representation W(q) = p(q) (p(q)" - 2), we see that 
if w(q)? = r° > 1, then |p(q)| > r. Hence 


Yo ¥@?q" log go = Y) W(g@)q"" log go—r D> (OW) | log qo. 


gex.lw@lsr6 qx qx 


Moreover, (p(q)W(q))” < p(q)? max(2, p(q)®) < max(3, p(q)’). Two appeals to 
Lemma 4 then show that for r2 > 17, 


Y> |w(qlq7! log go = 7° (1 = 17r) logx + O(1). 


gsx.|v(g)|<r6 
With r = 3/2, 38 - 24c6 = 1, 


ba IW(glq| log go = co logx + O(1) 


qx 


and, after an integration by parts, 


> Wk) = cologlogx + O(1), 


pk<x,(p,D)=1 


uniformly for x > 2. 
Note that, together with an application of the Cauchy—Schwarz inequality, the 
second estimate of Lemma 4 shows that 


De lW(q)|q_' log qo < logx + O(1). 


qsx 


We shall employ this bound to offset the variability in the sign of w(q). 
We may modify the argument of Lemma 4, replacing w(q)? by (q), and without 
differentiation obtain a Laurent expansion 


S- v(pS&*) t= ete, [sl <e. 


(pk D)=1 


To this end we note that Ly(s, sym?) is analytic and non-zero in a proper neighbour- 
hood of s = 1. In fact, the existence of a zero of order m at s = 1 would require the 
addition of a term —mlog(s — 1) to the sum and increase the value of the constant 
cs in the present lemma. 

With oy = 1 + (log x)~!, an integration by parts shows 


(logx) So |wiglg™ loggo hence =) |w(p')|(kp*)! 


q>x pk>x,(p,D)=1 
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to be bounded uniformly for x > 2. Moreover, 


Yo We =p) « dogs) S35 [Wig log qo 


pk<x,(p,D)=1 qex 


is also uniformly bounded. So, therefore, is the sum 


Yo wp). 


pe<x,(p,D)=1 


Altogether, setting w(p*) = k!, 


28=2 SY) |w@lq wa) 


qSxW(p)<0 


> V(V@|+ V¥@) a 'w@ = cologlogx + O(1). 


qx 


The terms with |¥(q)| > t > 0 contribute to $ at most 7! D7)... |W(@I?q"'w@) < 
t| loglogx + O(1). Fort > 2c;!, 


2 SY) g'w(q) = (cot! — 2) log logx + O(1). 
qSx,~(q)<0 


Choosing tcs = 4, 


16 > p | > cgloglogx + O(1). 
psx,W(p)<0 


Lemma 6 is established. 


Proof of Theorem 4 for the Coefficients of L(s, sym*) This now rapid. Define the 
multiplicative function g by assigning to g(p*) the value 1,—1 or 0 according 
to whether 5, is positive, negative or zero, respectively. That B(x) lies between 
positive constant multiples of x(logx)~! Thee (1 + lg(p) |p!) for x > 2 follows 
from Lemmas 5 and 2, the explicit lower bound from a second application of 
Lemma 5. 

That 


B(x)! 0) ->0, x7 @, 


nsx 


and the negative coefficients b, are uniformly distributed amongst those that are 
non-zero follows from Lemma 5, Lemma 6 and Theorem 3. 
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To gain the sharper estimate of Theorem 4 we give a quantitative aspect to the 
remark following the statement of Theorem 1. 

Let t be a value in the interval [—(logx)*, (logx)*] for which the sum 
ee p'(\g(p)| — Re g(p)p”) attains its minimum. For z in the complex unit 
dise |1 — 2?|? < 4]1 —z|? < 8(1 — Rez). We apply Lemma 1 with h(p) = |g(p)|, 
z= g(p)p", 86'/3 = c. There are two cases. 

In the first case the function A in the statement of Theorem |, with T = (log x)’, 
satisfies 8A > 5 0) <P 

In the second case (c/2) loglogx « 36(457!/31) + 30(2t) + 1. The definition of 
6 ensures that |t| < (log x)~® for a positive constant cg < | and all large x. Setting 
w = exp((log x)®) we note that 


Sop || =p"| « It] 9p logp « |t|logw « 1. 


psw pw 


Hence A > yak (lee) — g(p)) + OC), which Lemma 6 shows to exceed 
2cscg log logx + O(1). 

In either case A > co log log x + O(1) for a positive co and all x > 2. Theorem 4 
is established. 


Remarks For the purposes of establishing uniformity of sign changes in the 
coefficients b,, the behaviour of b,« on powers of finitely many primes is largely 
irrelevant. It should, perhaps, be emphasised that we do not need an accurate 
estimate for the number of non-vanishing coefficients b,; a positive lower density 
on the primes suffices. 

A version of Theorem 4 for L(s, sym!) and Maass forms attached to the action of 
SL(2, Z) on the complex upper half-plane is given in Elliott and Kish [4]. 

In view of the work of Barnet-Lamb, Geraghty, Harris and Taylor [1], analogues 
of the sharpened Theorem 4 may be established for symmetric product L-functions, 
of every order, that are attached to holomorphic forms for the action of a congruence 
group on the complex upper half-plane, provided the form is not of complex multi- 
plication type. Whilst the error terms in the estimates each save a power of log x, they 
do not currently have an explicit uniformity in the order of the symmetric product. 

An important feature of the present result is the availability of generalisations 
corresponding to the replacement of the group GL2(Ag) by GL,(AFr), with Ar 
the adéles of a finite algebraic extension, F, of the rationals. With this generality 
in mind, to establish the present Theorem 4 the argument given in Elliott and 
Kish [4], needed modification since appropriate analytic properties of the function 
Ve A(n)in~™, effectively of symmetric product L-functions of order 10 and 12, 


n=1 
are not currently available. 


Concluding Remark The value distribution function 


F(y) = fy! >» 1, x>1,yeR, 


nsx,f(n)<y 
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attached to an additive arithmetic function n — f(n), may be studied through the 
agency of its Fourier—Stieltjes transform 


[ e dF(y) = [x Sgn), teR, 


oo n<x 


where n > g(n) = exp(itf(n)) is multiplicative. Multiplicative functions play a 
central réle in probabilistic number theory. 

The characteristically catholic survey of probabilistic number theory given by 
Wolfgang in Kanazawa, Japan, 2005 [8], illustrates an aspect of his personality that, 
quite apart from his mathematics, I shall very much miss: his generosity of spirit. 
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On Error Sum Functions for Approximations 
with Arithmetic Conditions 


Carsten Elsner 


Dedicated to the memory of Professor Wolfgang Schwarz 


Abstract Let €)(@) = >> i tte |¢m& — Pm| be error sum functions formed by 
convergents Pm /Gm (m >= 0) of areal number @ satisfying the arithmetical condition 
dm = 1 (mod k) with 0 < 1 < k. The functions €; are Riemann-integrable on [0, 1], 
so that the integrals i Ex.(a@) da exist as the arithmetical means of the functions 
E;,,; on [0, 1]. We express these integrals by multiple sums on rational terms and 
prove upper and lower bounds. In the case when / vanishes (i.e. k divides g,,) and 
when the smallest prime divisor p; of k = p{'p5’---p;' satisfies p; > k* for some 
positive real number ¢, we have found an asymptotic expansion in terms of k, namely 
is Exo(a) da = €(2) (2¢(3)k2) + O(3'k-*~*). This result includes all integers k 
which are of the form k = p* for primes p and integers a > 1. 


Keywords Approximation with arithmetical conditions * Continued fractions 
Convergents * Error sum functions 


2010 Mathematics subject classification: Primary 11J04; Secondary 11J70 


1 Introduction 


There are many results in the literature concerned with rational approximations 
p/gq to irrational numbers, where p and g are restricted by additional arithmetical 
conditions. An important result in this direction is due to Uchiyama [11]. 


Theorem A For every real irrational number a and integers s > 1,a > 0,b>0 
such that a and b are not simultaneously divisible by s, there are infinitely many 
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integers p and q # 0 satisfying 


2 


a-— p=a (mods), q=b (mods). 


Pp S 
Ss 
| qi 4@ 
In [3] the author proved that the constant 1/4 in Uchiyama’s paper cannot be 
improved. Let ||7|| denote the distance of a real number 7 from the nearest integer. 
Then we deduce the following corollary from Theorem A: 


Corollary Al Let f : N > Rso be a function satisfying f(q) = 0(q) for positive 
integers q tending to infinity. Then, for every integers s > 0, a = 0 and every real 
irrational number a we have 


liminf f(q)||qo|| = 0. 
q>0 
q =a (mod s) 


In particular cases stronger results are possible, e.g., for the number e = exp(1) 
by Theorem 1.3 in [4]. 


Theorem B Let a and s be arbitrary positive integers. Then 


liminf q||ge|| = 0. 
q>0 


q =a (mod s) 


About 5 years later Komatsu [9, Theorem 4] showed that the result of Theorem B 
remains true for e replaced by every number e!/* (k € N). 
Recently, the author [5] studied the so-called error sum functions. Let 


E@) = Yo ldm&—Pml, — E*(@) = Ym — Pm), 
m=0 


m=0 


where for m > 0 the fraction pm/gm is the m-th convergent of the real number a. 
The numbers p,, and qg,, can be computed recursively from the continued fraction 
expansion of a. Various aspects of these functions have been investigated in [5-7], 
among them it is shown that 0 < E(a) < (1 + J/5)/2 and 0 < E*(a) < 1 for 
all real numbers w. Both, €(~) and €*(a), measure the average of error terms for 
diophantine approximations of a by rationals. Moreover, E€(@) € Q(a@) holds for 
real numbers of algebraic degree | and 2. For e = exp(1) we have the formula 


I 
E(e) = 2e | exp(—?’) dt—e = 1.3418751..., 
0 


which proves that E(e) ¢ Q(e). The function E(q@) is continuous for every real 
irrational point a, and discontinuous for all rational numbers a (see [6, Theorem 2]). 
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Therefore, the function € is Riemann-integrable on [0, 1]. It turns out [6, Theorem 5] 
that 


: 5  3£(2)log2 
/ E(a)da = — —“3@) = 0.79778798 ... , 


where ¢ denotes the Riemann Zeta function. This integral represents the arithmetical 
mean of the function € on [0, 1]. This result can be generalized. Letn = 1,2,3,... 


and 
1 
| 1 
0 


It can be shown [6, Sec. 4] that 


co 


n 
| dno —Pm| da. 
0 


m>= 


; — 1 2¢m+1,-1) 
= Qrtl (n+. 2) Ni 


where 


(1) oO 1 m2—1 —] my 
S(n+1,-1) = nl > ntti » 


m 
m2>m,>0 mM m=1 : 2 m=1 : 


is known as multiple Zeta function. Borwein et al. [2] expressed €(n + 1,—1) in 
terms of log 2, €(2), €(3),...,€( + 2). Thus we obtain the following results: 


Theorem C Let n> 1 be an integer. Then we have 


1 @« 
‘ 1 1 4log(2) (n + 1) ! 
= da = ——({1- ct ea Rr renee yrerer-<ah (a 
[ y |amox Pm| a aed ( =a) + (n+ 1)€(n 4+ 2) gn+l 


m=0 


l n—-1 1 1 
+ooperp L(-#) (1- ar) e+ Deen +10. 


k=1 


In particular, for n = 2,3, 4 we have the identities 


— 2, _ Tlog263) (2) 17 _ 
[ a dmQ — Pm da = ~ 6C(4) + 12¢(4) = 74 = 0,39813... ; 
12 3, _ Ilog2g(4)  3¢(2)63) 49 
fy % lana Pm eee GEG) ek 
10 2 
[ Yin eo TSG) SEO) TE ne aanay 
0 — 


a0c(6)  40¢(6) * 80¢(6) 160 
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Taking ¢(2s) € Q(z) into account, it follows that 


LEQ 
heQ 
1,€Q 
I4EQ 
Is € Q 
Is € Q 


, log(2), £(3)) 

, log(2), £(3)) 

x, log(2), 6(3),€(5)), 

x, log(2), 6(3),€(5)) 

m, log(2), 6(3),6(5),€(7)), 
x, log(2), (3). €(5),€(7)); 


CONN NBN SN | 


in particular we know lh,...,J6 € Q(x, log(2), €(3), €(5), £(7)). This proves that 
Ii,..., 16 are algebraically dependent over Q. But indeed a stronger result holds, 
which can be verified using a suitable computer algebra system. 


Corollary C1 The numbers [,, In, 13,14 are algebraically dependent over Q. For 
x; = 1; @ = 1,2,3, 4) the algebraic equation 
0 = 10240x,x3x4 — 2976x7x2 - 1488x7 — 5952x1x2 + 5120x1x3 + 7840x1 x4 
—2592x0x3 + 6400x3x4 + 944x, — 4542x. + 1904x3 + 4900x4 + 179 


holds. 


The proof works by substituting the above expressions for I, 12,15, 14 into the 
equation given in the corollary, where additionally 


ca x4 wr 

27=—, 4 = —, 6) = — 

(==. Mar. =e 

must be taken into account. 
Note that 
h,...,bn € Q(x, log(2), €(3), €(5),...,€(2n + 1)). 
RENE, nnn” 
n+2 


This proves 


Corollary C2 For every integern > 3 anyn+3 numbers from the set {I,, In, ... lon} 
are algebraically dependent over Q. 


In this paper we focus our interest on a generalized error sum function. Let a be 
areal number, and let k > 1 and 0 </ < k be integers. We define 


(oe) 


Ex (ot) = » lq — P| ; 
m=0 
dm = | (mod k) 
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in particular we set €,(a@) = Exo(q@). It is clear that €;(@) = E(a), and 


k-1 
Ye) = €@) (k= 1). 
l=0 


For k > 1 the error sum function €(a@) can be transformed into a more striking 
form. Since k does not divide go = 1, the term for m = 0 in €(q@) does not occur. 
Moreover, for the convergents pn/Gm (m > 1) of @ satisfying gn = 0 (mod k) we 
obtain the inequalities 


| | < ; < i < i 
me Din|\ SS oe ee 
_ i dm k 2 
This proves 
lo.) 
Ex@) = D> llama] = &> 1. (1.1) 
m=1 
Klam 


We continue to point out more basic properties of €,(a@) for k > 1. Since qm and 
m+1 are coprime, at most every second term in €;%(@) does not vanish. So we obtain 
the following upper bound for &(a): 


ea) = > + _ Ea) -E*(@) 

(a) = oe ldm&—Pm| < y., 1d2m+1¢—Pam+i| = — 1a (k> 1). 
m= 1 m=0 
kldm 


The identities 


AC Na a ans 


Ex(a) = 5 


with k > 1 hold for all numbers @ given by their continued fraction expansion 


a = [0:k, 1,4, 1,4,1,...] = [0:%, 1] = + 


1 1 1 
4k 2” 


I 


since for m > 0 we have the congruence relations g2m4+; = 0 (mod k) and goy = 
(mod k). Moreover, 


foe) oo 5 
Ex(a) = S |gam410 — Pam = x a (=) au = 


R 


m=0 m=0 
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There exist real irrational numbers @ for which the series €;,(a) consists of at most 
finitely many terms, contrary to the series €(a). To prove the existence of such 
an irrational number, we define a recursively by its continued fraction expansion 
a = [0; a), a7,...] = [0;2,1,1,2,2,4,6,...] as follows. We have 


a=2, @=3, g=5, ga=13, g=31, gqo=137, gq = 853. 


Now let us assume that for m > 8 the denominators g,,—; and gm—2 are primes. Then, 
by the Dirichlet prime number theorem, there are infinitely many positive integers 
a such that gn = dqm—1 + Gm—2 € P. The number a,, is uniquely defined by the 
smallest positive integer a satisfying this condition. Then, for every integer k > 1, 
the series E,(a@) consists of at most one term. Furthermore, there are many situations 
in which €;(a@) vanishes. 


Proposition 1.1 For every integer k > 1 there are uncountably many irrational 
numbers a such that Ex(a) = 0. 


To prove this proposition, let k > 1 be any integer. We define an irrational number 
a depending on k and on a sequence (b,,),,>> of positive integers by 
a = [0;1,kb2,kb3, kb4,...] = [03 a1, a2, a3,...] . 
The denominators g,, of the convergents P»,/qdm of a satisfy the recurrence formula 
go=l, g=a=1,  dn+2=4n42dmt1 + 9m (m=0,1,2,...). 


Since gm+2 = Gm (mod k) form = 0,1,2,... it follows recursively that 1 = qo = 
Q2 = 44 = +++ = Qom (mod &) and, similarly, 1 = gq) = q3 = ds = +++ = Qom41 
(mod k) form = 0,1,2,.... This proves that no denominator q,, is divisible by 
k. Hence, €(a) = 0. By Cantor’s counting principle we have found uncountably 
many real numbers @ satisfying €,(a@) = 0. 


The main goal of this paper is to study the behaviour of the numbers [; . Ex(a) dat 
depending on & and /. For / = 0 and k restricted to those numbers having no small 
prime divisors we prove the asymptotic behaviour of these integrals for k tending 
to infinity (Theorem 2.1 and Corollaries 2.2—2.5). For integers k having many small 
prime divisors the numbers i E;(a) da tend more quickly to zero than in the case 
k = p* for fixed a > | and primes p (Theorem 2.6 and Corollary 2.4). The integrals 
on the error sum functions €;; with / > 0 are treated in Theorem 2.7. 


On Error Sum Functions 127 


2 Statement of the Results 


Let w : N > {-1, 0, 1} be the Mobius function, and let €(s) = )°°, 1/n' for s > 2 
be the Riemann Zeta function. By J3 : N — N we denote Jordan’s arithmetical 
function defined by J3(1) = 1 and 


I(n) = eT] (1- =) (n> 1), (2.1) 
pin 


where p runs through all prime divisors of n. Moreover, for any integer n let D, 
denote the set of all positive divisors of n. For every positive integer r we define the 
number T,. by 


a) DR ern ee) 
n=r 1<m<n/r 


The identity from the following theorem can be considered as the main result 
of this paper, which contrasts with the property of the function €(a@) given by 
Proposition 1.1. 


Theorem 2.1 For every integer k > 1 we have 


; 3, ad! Ms) e(ks/1)T, 
[, fleoae = mm 2 sls) 


Corollary 2.2 Let k > 1 be any integer having t prime divisors , where P denotes 
the smallest prime divisor of k. Then we have 


_ £2) es 


Corollary 2.3 For all primes p we have 


i E,(a)da = 


and 


eae 3¢(2) log2 ie ¢(2) o(4) 


C3) 2&3). 4) 2k B)p? 


TIp? 
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Corollary 2.4 Let p be a prime and a be a positive integer. Set k := p*. Then we 
have 


§(2) 1 
i Ex (a) da = 2B)e + Oar) 5 


Corollary 2.5 Letk > 1 be an integer having at most t prime divisors. The smallest 
prime divisor P of k satisfies P > k° for any 0 < € < 1. Then we have 


§(2) of 3 ). 


1 
fede = se + Ogee 


To state the results in the subsequent theorems we need Euler’s totient g. 


Theorem 2.6 For every integer k => 3 we have 


1 < 9h) &(2) 
Plogione = ae < ff exteyac <e 


For the numbers k = p\p2-::p; given by the product on the first r => 2 primes 
P| = 2, po = 3,... we have 


1 


1 
Ex(a)da <x ————_.. 
/ ae k? log logk 


Theorem 2.6 shows that ue Ex (a) da =< k~* does not hold for k € N tending to 
infinity. In the following theorem we estimate the integral on the error sum function 
Ex(a) for 1 > 0, where the case / = 1 is treated separately. By (a, b) we denote the 
greatest common divisor of two integers a and b. 


Theorem 2.7 (i) For every integer k > 2 andl = 1 we have 


1 
3420 < | Ex, (a) da = 3 x 


8 4(k +1) ne ae 
a=1(modk) (a,b) =1 


(ii) For integers k => 3 and2 <1<k we have 


gl) (2) 
AP ah nenaer > 3 Saag 


aa 
a=1 b=0 
a=I(modk) (a,b) =1 
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For two consecutive primes p,—; and p, (r > 2) it follows from (ii) in 
Theorem 2.7 by Bertrand’s Postulate and Theorem 9 in [8] that 


1 


Plog? r- 


: 1 
: Ep, pp (a) da 7D = 


3 Auxiliary Results 


Lemma 3.1 Let k > 1 be an integer, and let r be any positive divisor of k. Then we 
have the identity 


eG) Ly HS) Ks/r) 
Xu a (3) Xu I3(ks/r) 
(d,k) =k/r 


Proof We obtain 


= (d) = (mk/r at (s) (mk /r 
ee a 


d=1 m=1 (mk/r) m=1 s>]1 
(d,k) =k/r (m,r) = 1 s|(m, r) 
re = L(mk/r) r\3 wo Ls) Sa nks/r) 
-Jow y P-gp see 
k m k i ‘ m 
s>l m=1 S' 5 m= 


s|r m = 0 (mod s) 


For any positive integer tf we have 


= a no por | 
2s = y ~ TOA 


m=1 m=1 
(m,t) = 1 


where the identity on the right-hand side can be obtained by using the method 
explained in [10]. Substituting the last expression into S by setting t = ks/r, we 
complete the proof of the desired identity from the lemma. Oo 
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Lemma 3.2 For every positive integer r we have 


oo nr—l 1 3 l 
jie —.. , T; = ~¢(2)log2 — —¢(3), 
Dera T= HBO? 30) 
Ui oghudts 
£(2) I 1 men 
T, = or +O(5). and oy < T, < 
i. rere 


In particular, we have T, < 2/r for allr > 1. 


Proof Let r = 1 be an integer. To prove the alternative expression of T,, we first 
observe that 


oo nr—l lee) {ea 1 lose) 1 1 
* Pave et ae an 3.1 
> aw m) oe y ke ar eee at ah) 


where the last identity follows by interchanging the order of summation, and where 
|7| denotes the floor function, i.e. the greatest integer not exceeding 7. Next, let 
B = 1 bea real number, and 


fees eae |B] = 1 (mod 2), 
~  (0,if [B| =0 (mod 2). 


Then we have 


LB] 1 LB/2|+8 1 16/2] 1 
ae Siren oor 
n=1 m=1 m=1 
which yields, equivalently, 
LB i os 1 7 , 2! 4 > 1 
no 2m—1 2 m n 
n=([B/2|+1 m=1 m=1 n=1 
2 eo | 1, 2! 4 
Ode i. 2a 
LB] m 
oa ain 
aot 


m=1 
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With 6 = k/r for k => r we conclude from (3.1) and (2.2) that 


oo nr—l (= Th a 
= 1. 
2 Becre, ee + my” 3 tm 


For the asymptotic expansion of 7, we apply Euler’s summation formula to the 
function f(x) = 1/(x + nr — 1)?: Let B(n) = n — [n] — 1/2. Then, 


Le 1 nr nr 
i = [fears [" Bey’) ax— Bay) —Beangar) 
ar (x + nr—1) 1 1 
1 1 1 
~ nr Qnr—I laa) 
which yields 
lo e) 1 nr 1 
C= y= 
dX u m=1 (w+ a Ds 


2 
=< ae + saomcp) + (a) 


Se Pe eae) 
= 52 sof 


T, is a special case of the multivariate zeta function ¢(m, n), see [1, Sect. 2.6]: 
co n—-l 


co on er {= as oo 
n= Dy Oe aye ores 


n=1m=1 n=1m=1 


(= —_ 


3 3 
= -£2,-1) + 168) = F6(2)log2- 7603). 
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The bounds for T, stated in the lemma follow from (2.2) by using the inequalities 


1 _yyrtl 

2 23 65. 

2 m 

l<m<n/r 
es) 
i © dt 1 2 

Ya</ —= <- (r=2), T) = 1A09757 +++ <2, 
m=r me r—l P? r-—1l r 


4 Proof of Theorem 2.1 


Let x~1 : N > {0, 1} be defined by 


lifn=/ (modh), 


Kein) = O otherwise . 


Note that y;(1) = 1 holds if and only if / = 1. At the beginning of the proof of 
Theorem 2.1 we follow the lines in Sect. 4 in [6] and modify the arguments. Let m 
and a),...,dm be positive integers. We define the rational numbers &, & by their 
continued fraction expansion: 


& = [0; a1,...,Am—1, Gm] and & = [0; a1, ..-,Am—1,4m + 1] : 


We have & < & for even m and & < & for odd m. We define the interval [,, 
by In = (&,&) for even m and I, = (&,&)) otherwise. It is well known that 
the intervals J, are disjoint for different positive integers aj,...,@ , and that for 
any fixed m the union of all closed intervals 7, gives the interval [0, 1]. For this 
decomposition of [0, 1] we express the integral as follows: 


10 
/ ~ (—1)" XK.1(Gm) (Go — Pm) da 


m=0 


1 
i Exs(a) da 
0 


1 i 1 
= za ) 4 ~~ (-1)” : XK.i(dm) (Gn — Pm) da 


m=1 


= il) i, 3 (-1)" _ = / Xk1(Gn)(Gm&% — Pm) da 


m=1 a=1 dm=1" 


loo) fo) 


lore) & 
= BEE E weston [Gane —pad de 


m=la,;=1 aQn=1 
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Note that p,, and g, depend on aj,...,@m,. The continued fraction expansion 
of every point a € J, has the form a = [0;q),...,@m—1,4m,...]. Hence the 
convergents p,/qy for v < m depend on J,,, but not on a € J,,. Therefore we 
compute the above integral on [&,, &] by 


& = 
J (ant — Pn) ds = (6 — §) S24 Sn = PP 


2 
Using 
Pm (an + 1)Pm-1 + Pm—2 
& =— and & = ————— 
dm (4m + 1)4m—1 + Im—2 


we compute the expressions 


—1)" 
a rn oe 
(Gm + 4m—1) 4m 
and 
Pm-19m + dm—1Pm + 2Pm4m 
& + & a he oe, ee = en 
(Gm + Gm—1)4n 
which give 


1 


7) 
(Gm — Pm) da = ——_—————_.. , 
i 24m(Gin + Gai)? 


and consequently 


fe ea Xe) a: 3 3 Xk(Gn) (4.1) 
(a) da = ——— vee —_——_... 
0 2 m=1a,;=1 an=1 24m (Gm + qm—1)” 


For the denominators of two subsequent convergents of the continued fraction of 
a = (0;a),...,@m,...) itis well known that (qm, @m—1) = 1. For fixed qm = a we 
count the solutions of q,—, = b with (a,b) = 1 and 0 < b < a— 1 in the multiple 
sum on the left-hand side of (4.1). It is necessary to distinguish the cases m > 2 and 
m=1. 


Case 1: m-> 2. First let aj = 1. Then, 


dm-1 
dm 


= (0; dm,..., 2, 1) = (05 dm... 42+ 1). 
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For a; > 2 we have 


dm—-1 
dm 


= (03 dm,.-.,42,41) = (0; Gn,...,€2,a,; —1,1). 


Case 2: m= 1. Fora, = 1 we have a unique representation of the fraction 


m— 1 1 
oo = Pea = (0:1), 
dm q a\ 1 


since the integer part dj) = O must not be changed. For a; > 2 there are again 
two representations: 


mm 1 
fe = ihe arm el 


dm 71 eal 


Therefore it becomes clear that for any fixed g,, = a every coprime integer b with 
0 < b < a—1 occurs exactly two times in the multiple sum on the right-hand side 
of (4.1), except for m = 1 and a; = 1. For this exceptional case we separate the 
term 


Xe) Xk) 


2q1(q1 + 40)” 8 


from the multiple sum. Therefore we obtain 


1 
/ Eq(tx) de 
0 


l oO 600 lo) ‘: ioe) l 
= Ha Xn) > er Xxi(Gn) ad sy Xx (qi) a Xe) 
m=2a,;=1 am=1 24m(4m =i Ym-1) a;=2 2q(M1 + 9) 8 


= se x XK ia) _ 


=e a(a +b)? 
(a, i =1 
a-—1 
3xK10) 
= ——— + (4.2) 
8 by Xu aa + by 


a=I/(modk) (a,b) =1 
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Note that for b = 0 the condition (a,0) = 1 holds for a = 1 only. For the proof 
of Theorem 2.1 we now assume that / = 0, so that 7,;(1) vanishes. Then (4.2) 
simplifies to 


a-1 


i ae ant (4.3) 


a=1 b=0 
kla_ (a,b) =1 


Next, we express the arithmetic condition (a,b) = 1 ona and b from the inner sum 
by the Mobius function. Then we proceed by interchanging the order of the resulting 
triple sum. Here, [d, k] denotes the least common multiple of d and k. 


1 wd) [o-e) [o.0) a-1 u(d) 
Ex(a)da = — ee 
I EEL wer LL Late 
ka d|(a,b) [d,k]|a d\b 
([d,k]n—1)/d 
= 3 3 ye pd) 
= ———— 
d=1 n=1 m=0 [d, k\n([d, k|n + dm) 
[d,k]n/d-1 
* pd) 


II 


CO 
(d,k) = k/r 


The condition (d, k) = k/r implies that 


Mist 224 
tl aa a 


Hence the above multiple sum takes the form 


oo nr—l 


1 : i(d) 
/ Ex(a) da = Xu x er rer nrd(nrd + md)” 
(d,k) =k/r 
2S y eOys5 
7 XA ar (ee ore a 
(d,k) = k/r 


Finally, we express the two terms in brackets by the identities given in Lemma 3.1 
and Lemma 3.2, respectively. This completes the proof of the theorem. O 
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5 Proofs of Corollaries 2.2—2.5 


Proof of Corollary 2.2 From the multiple sum in Theorem 2.1 we separate the term 


forr=kands=1: 


(s)u(ks/ rT, 
[ eae “ar tah ere ay 


D, 


ae 

- | 2(s) u(ks/r)|T,, 

- sat 103: 2 asta) 
a 

= B+ DL latomsini ts) 
reDseD, 
r#kVs#1 


Here we have applied the inequalities T, « 1/r (Lemma 3.2) and 


h(n) = “TT (1-5) =r T1G-5) = a (n> 1) 


pin 


(5.1) 


(5.2) 


(see [8, Theorem 280]). In order to estimate k*s*/r we discuss the following two 
cases. Recall that r|k and s|r, and that the number P is the smallest prime divisor of k. 


Casel: 1l<r<kands=1. 


Case2: 1<r<kands>P. 


3 3 3 3 
Sf 2 a PP > Pe. 
: 


Using these bounds we estimate the error term in (5.1). This gives 


1 
[ &layae = FE +O( TL mormtss/1 ze) 


reD, seD, 
rekvsFl 


~ aa 7 (ng » » \we(s)qe(ks/r)]) 


r€D, sED, 


= Ee + Cg) + (pe HD momen. 


ré€D,; s€D, 
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where we have applied the asymptotic formula for 7; from Lemma 3.2. To complete 
the proof of the corollary we finally prove the identity 


a= D> Yo lM(swlks/)| = 3 (k= pt-pt) 


r€D, sED, 


by induction with respect to tf. Fort = 1 let k = p*. We count three pairs 
[r, s] € Dy x D, such that |w(s)u(ks/r)| = 1 given by [p“, 1], [p*, p], and [p*!, 1]. 
Now we assume that g,’ = 3‘ holds for all integers k’ having ¢ prime divisors. Let 
k = pi'---pr'piiy! and k’ = pi! ---p;". While [r’,s’] € Dy x Dy runs through 
all 3‘ pairs which are counted for gy, we obtain g, by counting the 3 - 3’ pairs 
[r,s] € Dy x D, given by [r’pit'.s'], (pit! s’p4i], and Peo all. This 
completes the proof of Corollary 2.2. O 


Proof of Corollary 2.3 Fork = p € P the multiple sum in Theorem 2.1 consists 
of three terms corresponding to [r = 1,5 = 1], [r = p,s = 1], and [r = p, s = p]. 
Therefore, we obtain 


: 1 7 Ty iG 
CAC ae eee ee 
i padde = oe (-s +S +a) 


With J3(p) = p? — 1 and 
T| = 5¢(2)1 2 £3) 
ee eke 


given in Lemma 3.2 we prove the identity stated in the corollary. Now, for p > 3 we 
have with Lemma 3.2: 


: 1 : 3¢(2)log2 1 97 
[ playaa ee ee 
0 p?—1\(p— 1) 2¢(3) a 109p 
and 
1 
Pp 3¢(2)log2 1 2 
E da > —~ - —_+-] > i 
fede > Gy Ra ta) > ae 
This proves the inequalities stated in Corollary 2.3. Oo 


Proofs of Corollaries 2.4 and 2.5 We apply Corollary 2.2 with k = p*,t = 1, 
and P = p. Then the error term in Corollary 2.2 takes the form 


t 
o(h+ &) = (b+) =o ae) = ohn) 


Thus Corollary 2.4 is proven. Also Corollary 2.5 follows immediately from 
Corollary 2.2. O 
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6 Proofs of Theorems 2.6 and 2.7 


Proof of Theorem 2.6 For the upper bound we estimate the right-hand side of the 
identity in (4.3). Let k > 2. Then 


i’ lee) km—1 1 co km—1 1 lee) 1 ¢(2) 

Ex(a)da = Y) D) et) eo = ie 2 

0 m=1 b=1 km(km a b) m=1 b=1 k’m m=1 km k 
(km, b) = 1 


On the other hand, the lower bound for the integral in Theorem 2.6 follows from 
the identity (4.3), too. Here we assume that k > 3. 

= 1 (k) (k) I 
Y= EE oe 


1 
E a)da > — SS: 
i He) aes kk+by ~ kQk—-ly 46 ~ Bloglogk 


b=1 
(k,b) =1 
(6.1) 


The inequality on the right-hand side involving a lower bound of Euler’s totient 
follows from Theorem 328 in [8]. 
Next, let k = pip2---p, for some positive integer r > 2. Then we have 


logk = > logp = V(p,) < py 


PSDP 
peP 


by Theorem 414 in [8]. Applying additionally Theorem 429 in [8], we obtain 


1 1 1 1 
I] (i=) = I cs) “ise aia (6.2) 
po nee 
Moreover, 
1 loo) km—1 1 1 oo (k ) 
[eas Y ger e le 


m=1 boi m=1 


———— by (6.2)). 
< Pieslork (by (6.2)) 


Together with the lower bound (6.1) we complete the proof of the theorem. O 


On Error Sum Functions 139 


Proof of Theorem 2.7 By (4.2) we have already shown the identities in the 
theorem. So it remains to prove the inequalities. First, we prove the upper bounds. 


ie = 1 3x4 (1) 
» Xu a(a 4 b)? a 8 


a=1 b= 
a=I(modk) (a,b) =1 


_ 3 ai ee GE, 0700 0) 
= km + Dm $14 bP 8 
(km + 1,b) =1 
; ee 1 — 3 xu) _ ds. 3 1 3xxa) 
7 m=0 b=0 (km 7 I 8 P m=1 (km yy : 
1 3x01) _ (2) 3xei(1) 
7 a> ae 8 =2 Te 8 
5 €(2) 
= +> ifl=1 
5 WA) 
a tiS ifi>1 


For the lower bounds we treat the cases / = 1 and/ > 1 separately. First, let / = 1. 
Then 


lo.e) a-1 1 3 a(t) 
». 2 aoe 


a=1 b=0 aa + b) 
a=1(modk) (a,b) =1 
way fy (km + Akin +1 +b) 8 
(km + 1,b) = 1 


lee) km 
1 


5 
~ gt 2s d (km + 1)(km + 1 +b) 


(km+1,b)=1 


5 1 
= 3° 2 (k+ Ik +1+ bY 


b=1 
(k+1,b)=1 
k+1 
5 1 5 k+1 
8 i 4&+0) 8 4(k+ 1) 


(k+1,b)=1 
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Next, let / > 1. Then 


> = 1 — Hi) 


b 
a=1 =0 a(a+ by + , 
a=! (mod k) mer =1 


lee) km+l-1 1 


= 2 pe (km + D(km + 1-4 bY 


m=0 =] 


(km : 1,b)=1 
i-1 l 
l l gl) 

> ————_ > —_ = —_,, 
= a U1 ai by » 4B 4B 

b=1 b=1 

(Ib) =1 (,b) =1 
This completes the proof of the theorem. Oo 
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Sum of the Lerch Zeta-Function over Nontrivial 
Zeros of the Dirichlet L-Function 


Ramiunas GarunkStis and Justas Kalpokas 


Dedicated to the memory of Professor Wolfgang Schwarz 


Abstract For 0 < a < 1 and0 <A < 1, A rational, we consider the sum of values 
of the Lerch zeta-function L(A, a@,s) taken at the nontrivial zeros of the Dirichlet 
L-function L(s, x), where y mod Q, Q > 1, is a primitive Dirichlet character. 


Keywords Dirichlet L-function ¢ Value-distribution 


2010 Mathematics subject classification: Primary 11M06; Secondary 11M26 


1 Introduction 


Let s = o + it denote a complex variable. We use the notation e(x) = exp(2zix). 
By {x}, (a,b), and [a,b] we denote the fractional part of the real number x, the 
greatest common divisor of integers a, b, and the least common multiple of integers 
a, b, respectively. In this paper T always tends to plus infinity and ¢ is any positive 
number. 

The Lerch zeta-function is defined by 


S 60 
L(A, a,8) = ae (o > 1), 


n=0 
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where 0 < A < 1 and0O <a < 1. The Dirichlet L-function is defined by 


Ls.) = 22 >, 


n=1 


where y(n) is a Dirichlet character modulo some positive integer Q. For y mod 1 
we get the Riemann zeta-function L(s,y) = ¢(s). The yet unsolved generalized 
Riemann hypothesis (GRH) states that inside the critical strip 0 < o < 1 every 
Dirichlet L-function has zeros only on the critical line o = 5. Zeros in the critical 
strip are called nontrivial and we denote them by py = f, + iy. In view of the 
functional equation (see formula (2.4) below) the nontrivial zeros are symmetrically 
distributed with respect to the critical line. A Dirichlet character y mod Q is said 
to be primitive if it is not induced by any other character of modulus strictly less 
than Q. The unique principal character modulo Q is denoted by yo. The character 
Xo mod | is the only one principal and primitive character. For T > 0, let N(T, x) 
denote the number of the nontrivial zeros with 0 < y, < T. For the primitive 
character y mod Q, we have (Montgomery and Vaughan [17, Corollary 14.7]) 


T T 
— log a + O(log QT), 
20 e 


NT, x) = = 


where T > 4. 

For special values of a and A the Lerch zeta-function reduces to the Riemann 
zeta-function L(1,1,s) = ¢(s), LU,1/2,5) = (2% — 1)e(s), LU/2,1,5) = 
(1—2!~)€(s), the Dirichlet L-function L(1/2, 1/2,s) = 2°L(s, y), where yy mod 4 
is an odd Dirichlet character. If we fix A = 1 we get the Hurwitz zeta-function 
L(i,a,s) = ¢(s,a@) and if we fix @ = 1 we get the periodic zeta-function 
e(A)L(A, 1,8) = F(s,A). Nontrivial zeros p(A,a) = B(A,a) + iy(A,a@) of the 
Lerch zeta-function L(A, a, s) are located in the strip —1 <a < 1+ a ([9]). In view 
of the formula [8, Corollary 2] 


+ O(log T), 


1 T a 
BO.a) — 5) = tog 5 
oer ( 2) 2m VAC {4}) 


we see that the nontrivial zeros are not always symmetrically distributed with respect 
to the critical line. It is pleasant to recall that the paper [8] was written when the first 
author was visiting W. Schwarz at Frankfurt university. 

Let 


logp if x = p* for some prime p and integer k > 1, 
A(x) = 


0 otherwise 
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be the von Mangoldt function. For an integer n and a character y mod Q the Gauss 
sum is defined by 


Q 
G(n, x) = x(a) exp (2x) : 


a=1 


If (n,Q) = 1, then for a primitive character y mod Q, we have |G(n, y)| = /O 
and, for the principle character yo mod Q, it is known that G(n, ¥o) = ju(n), where 
[L(n) is the MGbius function. We shall prove the following result: 


Theorem 1.1 Let y mod Q, Q > 1, be a primitive Dirichlet character. 
We have, for0 <A = & < 1, (kg) = 1, and0 <a <1, 


> L(..«.p,) =~ (4(=) x(=) +000 qe(— wi) DL — 0.2, ») = 


O<yyST 
+0) (7 exp(—clog*~ T)) (1.1) 


where 6(Q,q) = | if Olg, 6(Q,q) = 0 otherwise, and c is a positive absolute 
constant. 
Further, for0 <2’ = & <LkKqg=1, 


2 


F2 — 500, eta h Ot 


T 
Bay on log 5, t C5 


de LA, 1, py) =x log 


O<yyST 
+0 ee T)) (1.2) 


where the constant C(x, A) is defined by Eq. (2.17) below. 
Let A be a positive constant. Both asymptotic formulas of this theorem are valid 
uniformly for Q < log’ T. 


Next we discuss the asymptotic formula (1.1) of Theorem 1.1. From the proof of 
Theorem 1.2 in Nakamura [18] we know that, for0 < A < landO <a <1, 


L(A,a,1) £0. 


The proof of the last formula is nice and short. By the integral representation (Lerch 
[16, formula (2)] or [15, formula (2.6)]) we have 
[ove} x71 el —a)x ove) x01 eI) x (or = e727) 


I'(o)LQ,a,0) =f 
0 


SSS aX. 
ex — e2tid 


Now Re L(A, @, 1) > 0 since Re (e'-®*(e* — e?*)) > 0 for x > 0. 
Formula (1.1) extends results obtained in Fujii [4], Steuding [10, 21], where the 
case A = 1,0 <a@ < 1, and Q = 1 was investigated. In [6] and [14] Formula (1.2) 
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with 0 < A = k/q < 1, (k,q) = 1,@ = 1, and Q = 1 was considered, see also 
Steuding [22]. 

In the paper [11] the sum of L(s, x) and in [7] the sum of L(s, y)L(. — s, x) over 
nontrivial zeros of another Dirichlet L-function were studied. 

The next section is devoted to the proof of Theorem 1.1. 


2 Proof of Theorem 1.1 


We will use contour integration. The proof of the theorem relies on the method of 
Conrey et al. [2]. We divide the proof into the following Sects.: 2.1. Beginning of 
the proof: Functional equations; Sect. 2.2. Gonek’s lemma; and Sect. 2.3. Perron’s 
formula. 


2.1 Beginning of the Proof: Functional Equations 


Similarly as in [11], without loss of generality, we consider T satisfying the 
inequality 


1 
min |T — y,| > ———. (2.1) 
Vx | v7 log(QT) 


For q = 1, y mod q and t > 0 we have (Prachar [20, Theorem 3.3 in Chap. 7]) 
Nyt + 1) — Ny = Hoy = By + ivy 2 t< yy St+ 1} « logQ@+ 2). 
Thus 


S 1 eb | —(s, x)L(A, a, s)ds + O(log Q), 


O<yyST 


where the contour € is a rectangle with vertices a + ib, a+ iT, 1 —a-+ iT, and 
1 —a + ib, where a = 1 + 1/log(QT), 2 < b < 3, min,, |b— yy| > 1/log@. 
Further 


1 


L’ 
Ani on LOA, a, s)ds 


1—a+iT at+ib a+ib L’ 
all +f fo +f —(s, x)L(A, a, s)ds 
at+ib at+iT 1l—a+iT 1—a+ib L 
4 


=>. a 


j=l 
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First we obtain 


| at+iT L' 
NA -— | ra YL, a, s)ds 


20i a+ib 


vil 
On 


x(m)A(m)e(An) 1 7 1 
=- yy ene) (min +)" val (n+ aye 


m=2 n=0 


=—A (=) x (=) = + O(log”(QT)). 
a a) 2n 


Second we consider 72. Formula (6) from [11] gives [recall that T satisfies (2.1)] 


T L’ 
[ Te + it, YL, a,a + it)dt 


EL 
TAG + iT, x) <log’*(OT) for —1l<o<2,T>2. C2) 


By Corollary 4 from [5] we have 


1 1 
W664i <T eet for =——=6<414+—— (3) 
log(QT) log(QT) 
Thus we get 
l l-atiT 71 
A= — —(s, y)L(A,a,s)ds = O (73 log*(QT)) ; 
20i a+iT L 


Similarly we see that 


= O(log? Q). 
A change of variables s + 1 —%5 gives 
1 at+iT L' 
A=-— —(1-5, y)LO(,a, 1 —5)ds. 
2m at+ib L 
After conjugation we get 
ae 1 at+iT 7! 
A= —(1-s,7)LO,a,1—5)ds. 


20 at+ib L 
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To evaluate this integral we will use functional equations. A Dirichlet L-function to a 
primitive character x mod Q satisfies the functional equation (Apostol [1, Theorem 
12.11]) 

Li-s,z) = Ad —s, y)L(s, x), (2.4) 


where 


AQ -s,x) = Wi (=) I'(s) («x (-=) + w(—l) exp (=)) . 


Taking the logarithmic derivative of the functional equation we get 


L’ A! L’ 
A —S,X)= a —S,X)- aoe 
where 
A’ A’ tO 1 
—(s,¥) = —(1—-s,7) = -—log—+0(-), 2.5 
7A 0 A‘ 5,%) oe (-) (2.5) 


for t > 1. The Lerch zeta-function satisfies the functional equation (Lerch [16] 
or [15]) 


L(A, a,1—s) =Qx)“T(s) (« (5 = wh) LU —a,A,s) (2.6) 
Seg (-; tai {A})) L(a,1—{A}, »). 
Therefore 
aa —5s,%)L(,a,1—5) 
= (So. y- Zs, ) (22)-*T'(s) (« (-; + a2) L(a, 2,5) 
ag (5 st {A})) Mise Os »). 
Then 
= 1 atiT A’ 3 5 
Ts = — e(wh)=— / cn ADAM Te (-=) L(a,A,s)ds 


atiT A’ 
~e(-a(1 — ws [, TODA Pe (=) L(1—a,1— {2}, s)ds 
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1 a+iT L’ 
+ (ad) =— [., (Qn) T(s)e (-z) (8, DL. A. 8)ds 
1 atiT L’ 
+ e(-a(1 — A) / ag OD Te (5) FO. LA — 4,1 — {2},s)ds 
4 
= ee 
j=l 


say. In view of the bound (2.5), Stirling’s formula, and by the bound [5, Corollary 2] 
L(A, a,a + it) < logt, (2.7) 
we have 
Fo,F4 1. 


Summarizing the so far obtained results we see that, forO <A <1,0<a <1, 
and Q > 1, 


| LA,a, py) =—A (=) x (=) are ee Cae), (2.8) 


a) 2n 
O0<yy<T 


2.2. Gonek’s Lemma 
For F; and F3 we will use the following version of Gonek’s Lemma (c.f. Gonek 
{12, Lemma 5] and [3, Lemma 2]). 


Lemma 2.1 Assume that >°,<,|@m| « x and b, < 1. Let1 <c < 1+ 1/logt 
and 0 <6 < 1, then 


1 ctit oo a oo b 
= AQ — bali —" _g 
2ni Seti pe m* d (n+ a@)° : 
= Yo anbye(—m(n + @)) + 0 (e2(¢-1)*) 
nee 


uniformly in a € [5, 1]. Here 
A(1 —s) = 2(2)~I'(s) cos > 


is a factor from the functional equation ¢(1 — s) = AC —s)é(s). 
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Proof This is Lemma 5 in [10]. 


Let 6 = +1. In order to apply Lemma 2.1 for our purpose note that 


e(87) _ { O(exp(-ml1))) if r= 0. Os) 
2cos 1 + O (exp (—z|¢|)) otherwise. j 


We return to the integral F; and rewrite it as follows: 


T al at+it 
-e(ar | “(ati oa (22) (s)e (-3) H(@,4,s\as). 


In view of (2.9) and the bound from (2.7) Lemma 2.1 gives 


Fi = 


1 a+it ue i 
a i 7 (21)T(sye (-3) L(a, A, s)ds 


= > e(an)e(—(n + A)) + O(t?**) 


if 
O<ns 5-1 


= e(-A) > e(an) + O(r2+*) = O(r2t*). 


O<n<-A 
IT 


Then using Expression (2.5) we have, fora # 1, 


T Al 
J, = -e(ar | (a + it,y)d (o?*9) = O(T'/2+*) 
b 


and, fora = 1, 


T T 
F; = — log aes or), 
20 21e 


Next we consider the integral 73. Again by formula (2.9) and Lemma 2.1 we get 


Fy=—e(ad) S>  Alm)x(me(an)e(—m(n + d)) + O(7"7*5). 
hice a 


We split the last sum into two sums with (m,q) = 1 and (m,q) > 1 accordingly. 
For the case (m, q) > 1 we obtain, fora # 1, 


Y> Am) x(m)e(an)e(—md) = O| Y~ A(m) | = OlogT) 
ae 1 


m= £ 


IT 
(m,q)>1 
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and, fora = 1, 


Y> Alm) x(m)e(an)e(—md) =5 y> Alm) z(me(—ma)— + O(log T) 


m>1,n>0,(m,q)>1 


m(n--a)< Hea 
_T Diep LPP) | ooo 7 
= ED ose PP 5 O08. 
pla j=l : 


Now we use that A = = If (m, q) = 1, then the orthogonality relation for Dirichlet 
characters gives 


q 
Atmx(me (-m=) =7— YY Hae (-«-) A(m) xm) (on) 


w mod g a=1 


1 — 
70) Yo G-k WAM) (myn), 


w mod q 
where G(—k, VW) is the Gauss sum defined before Theorem 1.1. Then, for0 < a < 1, 


- e(ad) 


er) 


Dd GOK) DT Alm) x(m)y(me(an) (2.10) 
w mod q m>1,n>0 
(nA) <7 


~ (a ~ taped) a logp = ee + o(r/2te), 


plq j=l 


Again, we summarize obtained results. In view of formula (2.8) Sect. 2.2 gives, for 
0<A<1,0<a<l,andQ>1, 


dE LA.a, py) = (@- (a) 5— log = — A (=) x (=) | 454 ors). 


O0<yyST 2x 
(2.11) 


2.3. Perron’s Formula 


In Titchmarsh [23, Lemma 3.12] Perron’s formula for ordinary Dirichlet series 
is proved. Easy to check that the analogous lemma is true for Dirichlet series 
L(s, xw)L(a, 4,5). Note that Perron [19] proved his formula (without error term) 
for general Dirichlet series. Therefore, for the second sum of the formula (2.10), 


150 R. GarunkStis and J. Kalpokas 


Perron’s formula gives 


Y Aemxempypometen) =~ = / oe E (6, xW)La. 2.9) (z) . 


Wl Jq-i IU 


40 T log? T 
7 . 


In [20, Chap. 8, Theorem 6.2] considering [q,Q] « log’ 7, with B being any 
positive constant, we find 


m>1,n>0 


m(n-+A)< xo 


Cc 


Lis,xv) #40 for o > 1—-—,—, 
log t€ T 


where c is an absolute positive constant. With regard to this zero-free region for 
L(s, xv) let by = 1 -—c/ log’st9 T. Shifting the line of integration we get 


L' T\'1 

D2 AG) xlmyy mean) = = Resa $66, 20)L,2,9) (5) : 

m>1,n>0 L Qn S 
m(n-+a)< 


1 bi +iU by-iU a—iU L’ T 5 ds 
+ — ff + +/ —(s, xW)L(a, 4,5) (=) _ (2.12) 
2ni (Jativ bi +iU b,—-iu ) L 2a) s 


+40 T log? T 
U 


L' T\*1 
=: —Res,=1 —(s, yW)L(a, A, 5) (=) —-+kKi+Ko+K3t+ o( 
L 2a} s 


T log? T 
7 ‘ 
Recall that the character y is primitive. Thus if yy is a principal character mod 
[q, Q], then Q|g and 
v = Xo. 


where Wo is a principle character modgq. In view of this we introduce the notation 


1 if Qlq, 
0 otherwise. 


8(Q,q) = (2.13) 


The Dirichlet L-function attached to the principal character has a simple pole at 
s = 1, otherwise it is an entire function. In view of L(s, Yo) = C(s) le —p°*) 
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we have 


Zs, Yo) = (s > 1), 


pla P 


here y is the Euler—Mascheroni constant. For 0 < a < 1, the function L(a, 4,5) is 
entire and, for a = 1, the function L(@, 4,5) = €(s,A) has a simple pole at s = 1. 
It is known (Ivié [13, formula (1.122)]) that 


f(s,A) = —— +7) + 06-1) (s > 1), 
where 
a | r'(a) 
yA) = iim (oy — log(N + ») == PF) 


The last equality can be found in Wilton [24]. Note that y(1) is the Euler— 
Mascheroni constant. Thus, fora 4 1, 


— Res;=1 Zs, Wo)L(a, 2 o(F -) -= = L(a,d, D5 (2.14) 


and, fora = 1, 


L' T\*1 T T r’(a) logp \ T 

Re —in (tte) | — |) = — ee — = | = 

Bai 7 i volte »(=) 5 Qn Ine (74 ao =i | ox 
P\Q 


P 
(2.15) 
If yy is a nonprincipal character mod[q, Q], then for a = 1, 
L’ T\*1 L’ T 
— Res;=1 —(s, L(a,d, s—) -=-7(., —: 2.16 
esa Ts 4WL@.A,8)(-) 2 = Sa 02.16) 


Integrals 1, 2, and K3 are analogous to the integrals which appear in Formula 
(9) of [11]. By a similar reasoning as in [11], using bounds (2.2), (2.3), and choosing 
U = T!|, we get that 


Ki <« T eA log AFET 


for j = 1,2,3. In view of formulas (2.10)-(2.13) the residue (2.14) leads to 
Expression (1.1) of Theorem 1.1. Finally, residues (2.15) and (2.16) give (1.2) with 
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the constant 


C(x,4) = -6(Q, g)e(-A) 


Hq) { F'A) logp 
o@ | Fay t?tX5- 


Plq 


Y Gk Wl. 2.17) 


yw mod g 
xW#VO 


© F(pi in, 
= Sige 


piq j=l P 


e(—A) 
$(q) 


This proves Theorem 1.1. 
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Schur—Weyl Dualities Old and New 
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Abstract We give an overview of Schur—Wey] dualities involved in the representa- 
tion theory of orthogonal and symplectic groups and of the properties of a new class 
of algebras occurring in this context, the Brauer Schur algebras. 


Keywords Brauer Schur algebras * Schur—Wey] duality 
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1 A Problem in Classical Invariant Theory 


Let k be an algebraically closed field of any characteristic and G a general linear, 
orthogonal or symplectic group over k. A typical setup of classical invariant theory 
is to let the group G act on its natural representation E. Then it also acts on tensor 
powers of this natural representation F, on (tensor powers of) exterior powers and 
on (tensor powers of) symmetric powers of E. 

Fundamental problem: What are the symmetries of such representations, that is, 
the endomorphism rings Endg(M), where M is one of these representations? 

When addressing this problem, one tries to find characteristic free properties 
of the endomorphism rings, whose precise structure does in general depend on the 
characteristic of k. Moreover, one is hoping to find double centraliser properties: 
A bimodule 4Mgz over algebras A and B is said to satisfy the double centraliser 
property if 


B ~ Endy(M) andA ~ Endg(M). 
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Assuming that M is a left A-module for some algebra A, then M is also a right 
B := Enda(M)-module via m- gy := g(m) for any m € M and g@ é B, and 
the two actions on M commute. Without loss of generality we may assume that 
A acts faithfully on M, that is if am = 0 for all m € M, then the element a € A 
must be zero. We then always have A C Endg(M) where a € A is mapped to the 
homomorphism multiplication with a, however, normally one cannot expect to have 
a set-equality, that is a double centraliser property. Generally, set-equality can only 
occur in case A (and B) acts faithfully on M. Therefore, when looking for double 
centraliser properties in our context, one has to define A as the largest quotient of 
the group algebra kG acting faithfully on the representation M. Since we choose 
M to be a finite dimensional vector space, the algebra A in a double centraliser 
property is a finite dimensional associative algebra. The given representation of G 
on M sends group elements to matrices acting k-linearly on M. The algebra A then 
is linearly generated by these matrices. The algebra A therefore is often called the 
enveloping algebra of the G-action on M. Sometimes, we will denote it by Seny(G), 
when M is fixed. 

When A happens to be a semisimple algebra, then M (finitely generated) by 
definition is a semisimple A-module. It affords a double centraliser property if and 
only if it is a faithful A-module. In this case, by Morita’s theorem, there is an 
equivalence of categories between A-modules and B-modules, that is the algebras 
A and B have the same representation theory. The algebras we are interested in 
are in general not semisimple. In this case, one cannot expect an equivalence of 
categories: In fact, in the examples discussed below, it may and will happen that A 
has finite global (cohomological) dimension—so, all extension groups between A- 
modules vanish in large degrees—while B has infinite global dimension—so, there 
are non-zero extension groups between B-modules in arbitrarily large degree. The 
double centraliser property does, however, still provide useful functors relating the 
representation theories of the algebras A and B. 

The best known case of a double centraliser property is that of tensor powers 
E®" of the natural representation E, typically referred to as (versions of) Schur- 
Weyl! duality. The enveloping algebra in this context is called Schur algebra. We 
will now state and explain Schur—Wey] duality for the groups considered here. 


1.1 Tensor Powers 
1.1.1 Classical Schur—Wey] Duality 


The n-dimensional k-vector space E is a module for the group G = GL, (k), where 
G acts on E by matrix multiplication. For any natural number r, the group G then 
acts on the r-fold tensor space E®” by diagonal extension of this action: 


g- (x1 @-+-@X,) = Bx, @... Bx; 


for vectors x; € E. The symmetric group »’, acts by place permutation on the r-fold 
tensor space. The two actions commute and do, in fact, centralise each other. When 
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n> r, the symmetric group acts faithfully with 
Endyg(E®’) = kX, and Endy, (E®”) = Senv(G), 


where the enveloping algebra Szn»(G) of G in End,(E®") is isomorphic to Green’s 
classical (type A) Schur algebra S(n,r). The relevance of Schur algebras is that 
the category of polynomial G-modules, which are homogenous of degree r, is 
equivalent to the category of S(n,r)-modules. When n < r the above action is no 
longer faithful and hence the group algebra k%’, has to be replaced by a certain 
known quotient algebra. In characteristic zero, classical Schur—Wey] duality is due 
to Schur [38], in general it follows from results of Carter and Lusztig [4], De Concini 
and Procesi [7] and Green [22, Theorem 2.6c], and [31], using the connection 
between double centraliser properties and dominant dimension. 


1.1.2 Orthogonal and Symplectic Schur—Wey] Dualities 


When we replace the general linear group by an orthogonal or symplectic subgroup, 
the endomorphism ring of tensor space must increase. 


The Brauer Algebra Before describing the endomorphism ring of tensor space 
in this situation, recall a diagrammatic description of symmetric group elements: 
Elements of the symmetric group 2 on r letters can be visualised by diagrams. 
The following picture shows two ways to write a permutation: 


T= be 38 corresponds to e e e e 
2341 po 


The product of two permutations can be written as concatenation of two diagrams 
where the vertices of the bottom row of the first diagram are connected vertically 
with the vertices of the top row of the second diagram: 


(30)) VS 
mater) SS [| 
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In orthogonal and symplectic Schur—Weyl] duality, Brauer algebras take the 
role played by the group algebra of the symmetric group in classical Schur—Weyl 
duality, described in Sect. 1.1.1. Let k be a field (or any commutative domain), 
and choose a parameter 6 € k. Let r be a natural number. The Brauer algebra 
B,(5) of degree r and parameter 6 is defined to be the vector space with k-basis 
given by the set of all Brauer diagrams on 2r vertices. A Brauer diagram is a 
diagram whose vertices are arranged in two rows of r vertices each, and there are r 
edges between the vertices such that each vertex is incident to precisely one edge. 
Brauer diagrams are considered up to homotopy, thus the dimension of B,(6) is 
(2r—1)!! = Qr—1)-(2r—3)---3-1. To multiply two Brauer diagrams, say b; and 
bo, the diagrams are concatenated as described for symmetric group elements, with 
b, drawn on top of bz, and any closed loops appearing are removed, to give a Brauer 
diagram d. The result of the multiplication then is, by definition, b}-b2 = 6°d, where 
c is the number of closed loops removed. Here is an example, the product of two 
elements in B4(6): 


Typically the parameter 6 is understood from the context, and we will denote the 
Brauer algebra by B, or just B. Brauer algebras were introduced in [2] in the context 
of generalising Schur—Wey] duality from general linear groups to orthogonal and 
symplectic subgroups. For more details and examples see, for instance, [2, 3, 23-25, 
29, 30, 32]. The restriction of the parameter 6 = +n is necessary to obtain an action 
of the Brauer algebra B,(6) on the r-fold tensor space and on the generalised exterior 
powers \*E as well as on the generalised symmetric powers Sym*E to be studied 
below. In characteristic zero, Brauer algebras are semisimple for non-integral 
parameter and for large integral parameter 6, by a famous result of Wenz] [41]. 


Orthogonal and Symplectic Groups As before, k is an algebraically closed field of 
characteristic p > O and let n,r be positive integers. Let E be an n-dimensional 
k-vector space and let w be a non-degenerate symmetric bilinear form on E. The 
orthogonal group relative to w is 


O, = {g € GL, | w(gx, gy) = w(x, y) forall x,y € FE}. 


Similarly for n = 2m even positive integer, let w be a non-degenerate skew- 
symmetric bilinear form on E. The symplectic group relative to w is 


Sp, = {g € GL, | (gx, gy) = w(x, y) for all x,y € E }. 
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All matrices here are defined over the field k, that is GL, = GL,(k), and so on. 
The classical groups GL,, Sp,, and O, operate on E by matrix multiplication. This 
action extends diagonally to an action on the tensor space E®”, which is connected 
to Brauer algebras. 


The Connection with Brauer Algebras Let G € {Sp,,O,}. Brauer diagrams can 
be interpreted as G-homomorphisms in the following way: Assume EF has basis 
{e,,..., én}, and let {e!,..., e”} be the dual basis of E with respect to the invariant 
form w. Define 


Then ? is G-invariant (see [20, 4.3.2.]). For 1 < i,j < n, define the (i,j)th 
contraction operator C;; : E®” > E®"~? by 


Cij(X1 ® +++ @ Xp) = W(X, X/) X1 @ +++ OX; @-+- OX @--- Ox, 


with vectors x;’s in E and where we omit the ith vector x; and the jth vector x; in 
the tensor product. Moreover, the (i, j)th expansion operator D;; : E®’~? > E®" is 
defined by 


n 


Dj j(x1 ® +++ @ Xp—-2) = Yo xy B+ Be, B+ Bel B+ Bxp2. 


t=1 


Here e, is in the ith position and e’ is in the jth position. Setting bj = D,; 0 C;,;, it is 
easily checked that b;; = b,j. It turns out that all elements in Endg(E®’) coincide 
with elements in the Brauer algebra B,. When labeling the vertices in the top row as 
well as the bottom row from left to right by | up to r, the element b;; coincides with 
the Brauer diagram 


e eee e e e see e e e eee e 
by = are 
e eae | o; e eee e e | eee e 


with the horizontal edges between vertices i and j. When considering the action 
of Brauer algebras on tensor space, the top row horizontal arc corresponds to the 
contraction operator, and the arc in the bottom row corresponds to the expansion 
operator. 

Diagrams consisting of r — 2/ through strings connecting top and bottom vertices 
(and / arcs at corresponding top and bottom places) naturally correspond to elements 
of a symmetric group »,2;. Such elements are G-endomorphisms of tensor space 
factoring through the smaller tensor space E®’~?!, Every Brauer diagram can be 
factorised as a product of contraction operators, an element of a symmetric group 
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»/,-23 and then a product of expansion operators. This factorisation is the basic 
ingredient of the cellular structure of the Brauer algebra, for details see [30]. 

From now on, we assume n > 2r in case G is a symplectic group and n > 2r in 
the orthogonal case. Then the Brauer algebra with parameter -En acts faithfully on 
tensor space E®”, Results by Brauer [2] in characteristic zero and in general by De 
Concini—Procesi [7], Oehms [35], Dipper-Doty—Hu [8, 17] and Tange [40] extend 
classical Schur—Weyl] duality to orthogonal and symplectic subgroups, implying in 
particular the following two isomorphisms: 


B,(n) = Endo, (E®’), B,(—n) = Ends,, (E®’). (1.1) 
Senv(On) = Endg, in (E®’), Senv(Sp,) = Endg,(—-m (E®’). (1.2) 


Recall that here S.,,(G) denotes the enveloping algebra in End,(E®’) of the 
respective group (see [16], [33] and [34] for more information on these algebras). 
A version of Schur—Weyl duality involving Homg(E®’, E®') with s and ¢ not 
necessarily equal can be found in [40]: In this version, tensor space E®” is replaced 
by a direct sum @’_)E®*. In [40, Theorem 3], Schur—Wey] duality is shown for this 
situation; the statement and the conditions in this situation coincide with those of 
usual Schur—Wey] duality. (This also works in the orthogonal case, see [40, Remark 
3].) Here, basis elements of G-homomorphisms between tensor spaces of different 
degrees are represented by generalised Brauer diagrams (called (u, v)-diagrams in 
[40]) with u vertices in the top row and v vertices in the bottom row. See [40, Sect. 3] 
for explanations and details. 


1.2. Exterior Powers 


Keep the group G as above, let E be its natural representation and j < dim,;(E). 
The exterior powers \/E of E are G-representations too, and so are tensor powers 
of these. For a natural number r, let A = (A,,...,4,) be a partition of r (written 
AF r), that is A; >--- >A, > Oandr = >“, A; is the sum of all the parts Aj. 
(The ordering condition on the parts 4; is not really important in our context. We 
may as well use compositions A, which are unordered partitions of r, denoted by 
A Er.) Then AE := AE @ +++ ® A*"E is a G-representation, where again g € G 
is acting diagonally on each entry of the tensor. 


1.2.1 The Classical Case of the General Linear Group 


Let n > rand G = GL,. Then @y+, A? E is a faithful representation of the 
classical Schur algebra S(n,r). Donkin [13], see also [14, 15], has shown that 
there is a double centraliser property relating S(n, 7) and the endomorphism ring 
Endg(@4+-+ Ah E). This endomorphism ring turns out to be Morita equivalent 
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to S(n,r) itself; when choosing suitable multiplicities of indecomposable direct 
summands, the two algebras even are isomorphic. This result, which has found 
plenty of applications in representation theory and invariant theory, relies on a deep 
structural property of these exterior powers, discovered by Donkin: These modules 
are tilting modules of S(n,r); more precisely, these are the characteristic tilting 
modules (in the sense of Ringel [36]) of the quasi-hereditary algebras S(n, r). (See 
Sect. 3 for more information on quasi-hereditary algebras.) 

Tilting theory in general is a far reaching generalisation of the theory of 
Morita equivalences. It provides equivalences not of module categories, but of 
derived module categories. Derived module categories are much coarser invariants 
than module categories, and thus there are many more derived equivalences than 
Morita equivalences. On any tilting module there is a double centraliser property. 
The functors providing the derived equivalences are constructed from particular 
modules, called tilting modules, which satisfy strong homological properties and 
which generalise the projective generators used in Morita theory. Building on work 
of Auslander and Reiten, Ringel constructed for any quasi-hereditary algebra— 
and Schur algebras are examples of quasi-hereditary algebras—a particular tilting 
module, namely the characteristic one. He showed that the endomorphism ring of 
such a characteristic tilting module again is quasi-hereditary. This endomorphism 
ring nowadays is called the Ringel dual of the original algebra. Donkin’s result then 
is that Schur algebras S(n,r) for n > r are Ringel self-dual. For n < r, the Ringel 
dual may be different, see, for example, [18], but Donkin showed that it is always a 
quasi-hereditary quotient of some Schur algebra. 


1.2.2 The Orthogonal and Symplectic Cases 


If G is orthogonal or symplectic, there are partial results by Adamowitch and 
Rybnikov [1] about characteristic tilting modules and Ringel duality, extending 
Donkin’s results. A full description is, however, still missing. 


1.3. Symmetric Powers 


The least known case in the orthogonal and symplectic case has been that of tensor 
powers of symmetric powers. 


1.3.1 The Classical Case of General Linear Groups 

The classical Schur algebra S(n, r) is dual to the coalgebra of polynomials in n? 
variables and of degree r, which is a subcoalgebra of the regular functions on the 
general linear groups, see [22]. Therefore, the generalised symmetric powers are 
injective modules over the classical Schur algebra, which is the endomorphism 
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ring of a direct sum of injective modules. Thus, nothing new is happening in this 
situation. This coincidence does, however, not carry over to the other two cases. 


1.3.2 The Orthogonal and Symplectic Cases 


Until recently, no endomorphism rings of generalised symmetric powers over 
symplectic or orthogonal groups had been calculated. When studying another, 
different kind of Schur algebras, which we call Schur algebras of Brauer algebras 
or just Brauer Schur algebras, denoted Sg(n, r), it surprisingly turned out that the 
previously unknown endomorphism algebras of generalised symmetric powers are 
examples of these. We state the result: 


Theorem 1.1 ((28]) Let G C GL, be an orthogonal or symplectic group, over an 
algebraically closed field k. Assume n = 2r when G is a symplectic group, and 
n > 2r when G is an orthogonal group. Denote by B, = B,(d) the Brauer algebra 
with parameter 6 = —n € k when G is a symplectic group, and 5 =n € k whenG 
is an orthogonal group. Then the centraliser algebra 


Endg( €) _— Sym’£) 
AFr—21,0SI< 5 


is isomorphic to the Schur algebra Sz(n, r) of the Brauer algebra B,. 


In the following sections, the original approach to Brauer Schur algebras Sz(n, r) 
will be explained and motivated, and their properties will be described. To conclude 
this section, we summarise the results and list the endomorphism rings End := 
Endg(M) when known where G € {GL,,O,,Sp,,} and M is the tensor space or 
generalised symmetric or exterior powers: 


Tensor space Exterior power Symmetric power 
General linear group, | Schur—Wey] duality Tilting theory Schur—Wey] duality 
n>r End = kJ, End = S(n,r) 
General linear group, | Schur—Wey] duality Tilting theory End = | Schur—Weyl] duality 
n<r End = quotient of quotient of a Schur 

kd, algebra ; 
Orthogonal/symplectic | Schur—Wey] duality ? End = Sp(n, r, En) 
group, n > 2r End = B,(=n) Brauer Schur algebra 
Orthogonal/symplectic | Schur—Wey] duality 2 ? 


group, n < 2r 
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2 The Other Side of Schur—Weyl! Duality 


2.1 The Case of the General Linear Groups and Symmetric 
Groups 


Schur—Weyl] duality relates representation theory of infinite general linear groups 
with representation theory of finite symmetric groups. The bimodule affording the 
double centraliser property, the tensor space, is a natural choice of a representation 
of the general linear group, being a tensor product of the natural representation. In 
order to explain the relevance of tensor space as a representation of the symmetric 
group, we first consider permutation representations of finite groups. 


2.1.1 Permutation Modules of Symmetric Groups 


The symmetric group 2 on r letters acts on a set X with r elements by permuting 
these elements. Choosing a vector space with basis X, one obtains a representation 
of »’. given by r x r-permutation matrices, whose entries are zero or one with 
exactly one entry one in each row and in each column. Such a module is called a 
permutation representation. Any finite group G is a subgroup of some symmetric 
group 2, and, by restriction of the action, the group G also has a permutation 
representation. More generally, many representations can be obtained by letting 
G act on a finite set. This action may not be transitive, that is, X may be a non- 
trivial disjoint union of many orbits, and it may not be faithful, that is, elements in 
X may have non-trivial stabilisers. An algebraic construction producing transitive 
permutation modules goes as follows: Choose a subgroup H C G and a field k. 
View k as the trivial representation of H, with all h € A acting as multiplication 
by one. Induce up this representation from H to G by forming the tensor product 
ind\Gk = k @xu kG. The resulting vector space is a representation of G, with the 
group G acting by right multiplication on the vector space ind(Gk. Its basis is given 
by the set of cosets X = H\G, and the stabiliser subgroup of the coset H is H itself. 
Using H the trivial group, we find the group algebra kG as an example of such a 
permutation module. 

For the symmetric group G = ,, a special class of subgroups is especially 
interesting, the Young subgroups: Fix a partition A = (Aj,...,An) of r (or a 
composition) and define the Young subgroup H = 2), x --- x Ly, of G, which 
is a product of smaller symmetric groups acting on the sets X; = {1,...,A;}, 
X = {A, +1,...,A1 + Az}, and so on. The induced module ind\Gk =k gs, kd, 
then is denoted by M*, with a basis of M’ given by the set of cosets 54,\D,. These 
are the permutation modules of interest in our context. 

In fact, taking e;,...,e, as basis of E, the r-fold tensor space of E has a basis 
consisting of all possible tensors ¢;, ® --- @ e;, with e;, in the given basis of E, for 
1 <j <r. The action of »’. permutes these tensors, which means that X’', acts on 
the set X of tensors. What are the transitive components, that is the direct summands 
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of the corresponding representation? Write i = (i),...,i,). Let A, be the number 
of 1s occuring as indices i; in e; := e;, ® +++ ® e;,, let Az be the number of 2s, and 
so on. Then A(i) := A = (Ay,..., An) is a composition of r, called the composition 
associated with the multi-index i = (i,,...,i,-). Permuting the entries of e; does 
not change the indices occuring in e;, and every permutation is possible. Thus the 
stabiliser of e;, ® --- @ e;, is the Young subgroup X. Therefore, the ¥’,-orbit of e; 
corresponds to a representation that is isomorphic to the permutation representation 
M*. To summarise, as a representation of 2, the r-fold tensor space is isomorphic 
to a direct sum of permutation representations M*, with some multiplicities. When 
n> r, all partitions A of r do occur in this decomposition of the tensor space. 


2.1.2 Specht Filtered Modules 


Permutation modules are not only combinatorially nice to handle but they also have 
an important structural property in the category of %’-modules. To explain this 
property, we need to look at the ring structure of the group algebra k»,. While 
the Schur algebra S(n,r) introduced in Sect. 1.1.1 is a quasi-hereditary algebra, 
the group algebra k»’, is not, unless it is semisimple. But it can be written as 
kS = eS(r, r)e for some idempotent e = e? in the Schur algebra S(r, r), that is with 
both parameters equal to r. As a consequence, k’, is a cellular algebra, as defined 
by Graham and Lehrer [21]. The analogue of Weyl modules, that is of the standard 
modules of the quasi-hereditary structure, of general linear groups are the Specht 
modules of symmetric groups. The Specht modules S* are indexed by partitions 
of r. In characteristic zero, the Specht modules are exactly the simple modules, up 
to isomorphism. In prime characteristic, they are in general not (semi)-simple, but 
all simple modules occur as quotients of Specht modules. Specht modules can be 
constructed in a combinatorial way, and as such they exist in each characteristic. 
The Schur functor e : S(r,r) — mod > kX’, — mod,M +> M -e, that is given by 
right multiplication by a certain idempotent e, sends the Weyl modules precisely 
to the Specht modules (which in the literature sometimes are called dual Specht 
modules). Let us denote by F (Weyl) the category of S(r,r)-modules having a 
Wey] filtration, that is, a finite chain of submodules with subquotients being Wey] 
modules; similarly, we denote by F (Specht) the category of kd’.-modules having 
a Specht filtration. The Schur functor restricts to a functor F(Weyl) — F (Specht), 
or in the language of quasi-hereditary algebras and cellular algebras to a functor 
F (standard) — F (cell). In characteristic zero (or bigger than r), F (Weyl) is the 
category of all S(r,r)-modules, F (Specht) is the category of all kS’,-modules and 
the Schur functor is an equivalence between these two categories. This equivalence 
is, in modern terms, the main result of Schur’s thesis, who showed in 1901 that in 
characteristic zero, the representation theories of the general linear and of the finite 
symmetric group are controlled by the same combinatorics. In 1927, he rederived 
this equivalence of categories in terms of Schur—Weyl] duality, which in his case is 
a relation between two semisimple algebras (both acting faithfully on tensor space, 
when n > r), as discussed in the previous section. 
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In prime characteristic 2 < char(k) < r, the filtered categories F (Weyl) and 
F (Specht) are proper subcategories of the module categories S(r, r)—mod and S'.— 
mod, respectively, which—as remarked above—cannot be equivalent categories. An 
influential result by Hemmer and Nakano [26] does, however, imply that the filtered 
categories are equivalent in this situation, as exact categories—except in the case of 
characteristic two or three; in characteristic three, there is no equivalence of exact 
categories, and in characteristic two, there is no equivalence at all—this can already 
be seen by trying the smallest possible example, k2'2 with k of characteristic two. 


2.1.3 Projective Objects in Filtered Categories 


The projective modules of the Schur algebra are objects in the filtered category 
F (Weyl), and they are, of course, projective in this category as well. Thus, their 
images under the Hemmer—Nakano equivalence must be a full set of projective 
objects in the filtered category F (Specht). The projective kX'-modules do have 
Specht filtrations (this is one ingredient in the cellular structure of k/). So, they 
are projective objects in the category F (Specht). But they do not form a full set 
of projective objects there. It turns out that the full set of projective objects in 
F (Specht) are provided by the permutation modules M*, which are in fact the 
images of the projective S(r, r)-modules under the Schur functor; moreover, eS(r, r) 
is just tensor space. This is their structural property announced above. In particular, 
a structural reason for tensor space occurring in our context is that it is a full 
sum of indecomposable projective objects in ¥ (Specht). Here full means that each 
isomorphism type occurs at least once. Such projective objects in subcategories 
often are called relative projective modules. 


2.2 The Case of the Orthogonal and Symplectic Groups 
and Brauer Algebras 


When trying to find in this case analogies to the objects and their relations 
described above, there are some surprises. As before, Schur—Wey] duality relates 
representation theory of symplectic and orthogonal subgroups of the general linear 
groups with representation theory of Brauer algebras. As before, the Brauer algebra 
is acellular algebra. Thus the category F (Specht) still exists, where the term Specht 
module refers to the cell modules of the cellular structure of the Brauer algebra (see 
[21]). However, tensor space is, in general, not Specht filtered (see [29]); thus it 
has been a problem to find ‘permutation modules’ that are relative projective in the 
category of Specht filtered modules. Moreover, an analogue of the Hemmer—Nakano 
theorem had to be found. 
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2.2.1 Permutation Modules of Brauer Algebras 


Is there an analogue of permutation modules, and are these the projective objects 
in F (Specht)? This last question has been answered by Hartmann and Paget [24]: 
here again the theory works in full analogy to the cellular structure of the group 
algebra of the symmetric group, and works as follows: For / < 5 and A F r—2I, the 
permutation module M(/, 4) is defined as 


M(1,A) = M? @ks,_», By 


where 


Oe as 2 oe ee a (2.1) 


with / arcs in the top and bottom row of the diagram, respectively, and M’ is the 
permutation module (indexed by 4) associated with the symmetric group 57,2). It 
should be noted that both the basis vectors of tensor space and the idempotents of 
the Brauer algebra are denoted in the literature by the symbol e;; the meaning will 
be clear from the context. 

Hartmann and Paget showed by combinatorial methods that these permutation 
modules M(/, A) in fact are the projective objects in the filtered category F (Specht). 
A more structural and much more general approach has been developed in [25], 
based on the concept of cellularly stratified algebras introduced there. 


2.2.2 Two Different Types of Schur Algebras and Their Schur—Weyl 
Dualities 


In type A, the classical situation, tensor space is Weyl filtered when seen over 
the general linear group; and tensor space is Specht filtered and a direct sum of 
permutation modules M* when seen over the group algebra of the symmetric group. 
By Sect. 2.1.1, the classical Schur algebra 


Endjy,(E®’) = S(n,r) = Ends, (CB M’*) 
AEr 


is the endomorphism ring of tensor space, but equally well, the endomorphism ring 
of a full sum of permutation modules. For orthogonal and symplectic groups G, 
tensor space is Wey] filtered over the orthogonal and symplectic groups. But here 
is now a fundamental difference: Tensor space is in general not Specht filtered and 
hence not a direct sum of permutation modules over the Brauer algebra, for details, 
see [29]. We therefore have two different algebras, taking the place of the classical 
Schur algebra: the first algebra is Donkin’s [12] generalised Schur algebra, also 
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referred to as symplectic or orthogonal Schur algebra: 
Seny (G) = Endg, (E®’). 


The second algebra, the Brauer Schur algebra, is defined to be the endomorphism 
ring of a full (meaning each isomorphism type occurs at least once) direct sum of all 
the permutation modules M(J, A), possibly with appropriately chosen multiplicities: 


Sp(n,r,8) = Sp(n,r) =Endg,( €)  MU.A)). 


Arr—21,0SI< 5 


The Brauer Schur algebra is, in general, not isomorphic to the Schur algebra of 
the orthogonal or symplectic group, and it is defined much more generally. One 
of its relevances is that the Brauer Schur algebra is the endomorphism ring of the 
direct sum of tensor powers of symmetric powers, see Theorem 1.1, with appropriate 
parameter 6. 

Together with these two different types of Schur algebras come two different 
Schur—Weyl] dualities for the Brauer algebra: There is a Schur—Wey] duality on 
tensor space, relating Donkin’s generalised Schur algebra with the Brauer algebra, 
as described in Sect. 1.1.2. However, there is also a Schur—Weyl] duality (observed 
abstractly in [25] and combinatorially in [27]) on the full direct sum of the 
permutation modules M(/,A), relating the Brauer Schur algebra with the Brauer 
algebra. More of this will be described in the next section. 


3 Properties of Brauer Schur Algebras 


In the final section of the article we summarise the ring theoretical and homological 
properties of Brauer Schur algebras. To do so, we first have another look at the 
classical Schur algebra, which has been defined as 


S(n,r) = Endy, (E®’) = Endgs, (@M*) 


where the sum runs over all compositions of r with at most n parts. 


3.1 Combinatorial Bases of Schur Algebras of Brauer Algebras 


In order to work with such an algebra, a more explicit definition is useful, given by 
a basis and a multiplication formula. Denote by /(n, r) the set of multi-indices of 
length r with entries in the set {1,...,”}. The symmetric group *’, operates on the 
set of multi-indices /(n, r) by permuting places, and hence the stabiliser of a multi- 
index is defined, Stab(i) = {t € S| it = i}. Recall the definition of a composition 
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A(i) associated with a given multi-index i, given in Sect. 2.1.1. One of the standardly 
used bases of the classical Schur algebra is indexed by certain pairs of multi-indices: 
Let k, 1 € I(n, r) with associated compositions A(k) = jz and A(/) = A, respectively. 
Define the 5,-homomorphism &, : M* — M¥ on the generator e; (or alternatively 
on the coset generator D) - id) of the Y.-module M* by 


& (2) id) = Ele) = Yo kalciaei- (3.1) 


i€l(n,r) 


Here the scalar & (c;;) = 0 unless there exists a permutation z € Stab(/) with 
km = iin which case we define &j(c;;) = 1. This definition is extended such 
that we have an equality of maps &; = &, if and only if there is a permutation 
xz € J, which simultaneously permutes i into k and j into J. The set of all such 
homomorphisms is an integral basis of the classical Schur algebra. This basis is 
defined in combinatorial terms only, and hence is independent of the ground field 
and its characteristics. There is a nice, but combinatorially involved, multiplication 
formula for two such basis vectors, expressing the product again as a linear 
combination in this basis. The structure constants here are given by the cardinality 
of certain sets, and hence are non-negative integers, and in particular, multiplication 
will depend on the characteristic of the underlying field, but the multiplication rule 
as such does not depend and the combinatorial description of the basis doesn’t 
either. This is a characteristic free property in the sense of the aims stated in the 
introduction. 
The precise multiplication rule is as follows (see [22, (2.3b)]): 


&ij . Ex = > Z(i, j, k, Lp, Dépq 


(p.9) 


where the sum is over a set of representatives (p, q) of orbits of the symmetric group 
»/, acting on /(n, r) x I(n, r). The number Z(i, j,k, 1, p, g)—to be seen as an element 
of the ground field—counts s € J(n,r) such that (i,j) and (p,s) are in the same 
»’,-orbit and also (k, /) and (s, g) are in the same orbit. Thus, the product is zero if j 
and k are not in the same ’,-orbit. 

We rewrite the homomorphisms given in (3.1). For a composition A, write 
ind(A) for a multi-index i whose entries are ordered increasingly with associated 
composition A. By the definition of the scalars in (3.1), without loss of generality, 
we may assume in the above setup that the entries of the multi-index / are ordered 
increasingly, that is, / = ind(A). There exists a permutation o with k = ind(w)o, 
and hence 


Stab(J) N Stab(k) = 24 No! Tyo. 
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The sum in (3.1) can thus be rewritten by 


(24+ id) = Ee i(e1) = > Cindy: 
ae(L,Na! Dy,o)\Zy 


We now look at the Brauer Schur algebra. For any fixed parameter 5 0 in the 
field k, the Brauer Schur algebra is 


Sp(n,r) =Endg,( EG) MUi,A)). 
Arr—21,0<I< 5 

In analogy to the above basis of the classical Schur algebra, an explicit integral basis 
of the Brauer Schur algebra Sz(n,r), indexed by certain double cosets, has been 
constructed in [27, Theorem 5.3]. Let A be a partition of r — 2/ and y a partition 
of r — 2m and let u > I,m. For a natural number s define the group H, = C2 X,. 
Choose representatives m and o of the double cosets ¥,,\ X—2m/(2—2u X Hu—m) 
and (X), x H,—))\X;—21/ =, respectively, where ©, = Y,_2, A 27! +,1. Define 
the B-homomorphism @¢,.7.¢ : M(/,A) — M(m, 4) on a generator by: 


$uxo(2a id ® e1) = a (ytd ® erru)o a. 
aE SyNo!(S\ XAy—)o\ Ey 


Theorem 3.1 ([27, Theorem 7.1]) The set of all such B-homomorphisms forms a 
basis of the Brauer Schur algebra Sp(n, r). 


Again this basis itself does depend on various combinatorial data, however, not on 
the ground field k and its characteristics, nor the parameter 6 of the Brauer algebra. 
So, the Brauer Schur algebra is defined integrally and hence over any ground ring by 
change of scalars. Products of two such basis vectors can be computed in analogy 
to the multiplication in the classical Schur algebra, in particular, structure constants 
are again non-negative integers. This basis is characteristic free in the same way as 
Green’s basis of the classical Schur algebra. 

The B-modules M(I,A) are by definition induced from the ¥,-modules M*. 
Therefore, inducing up a map M* -—> M# yields a map M(I,A) > M(i, 1)— 
note that both modules have the same index /. Checking the details of the above 
description one shows that in this way Green’s basis elements §; correspond to 
basis elements Qy,id¢,¢ : M(/,A) — M(I, 4) where A = A(/) andj has non-decreasing 
entries, 4 = A(i) and i is obtained from the index with non-decreasing entries by 
applying the permutation o. So, the above basis of Sg(n, r) is indeed an extension of 
Green’s basis of the classical Schur algebra S(n, r—2/). Checking the multiplication 
rules, it turns out that the classical Schur algebra S(n, r — 2/) is (for each /) a non- 
unital subalgebra of Sg(n, r). It follows that the (in general unknown) decomposition 
matrices of classical Schur algebras are diagonal submatrices of the decomposition 
matrix of the Brauer Schur algebra Sg(n, r). See Sect. 6 in [27] for details. 
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3.2. Quasi-Hereditary Structure 


Quasi-hereditary algebras were introduced by Scott [39] in the context of highest 
weight categories. They arise naturally in the representation theory of semisimple 
complex Lie algebras and algebraic groups. Cline, Parshall and Scott have shown 
that in general highest weight categories can be described in terms of quasi- 
hereditary algebras and their module categories, see [5, 6]. Many other algebras 
have been shown to be quasi-hereditary too, for instance, path algebras of quivers, 
and more generally algebras of global dimension less than or equal to two. 

There are two customary ways to define quasi-hereditary algebras. One is based 
on the existence of a chainO0 C J; C --- C J, = A of two-sided ideals, called 
heredity chain. These ideals are required to be generated by idempotent elements 
and to satisfy strong homological conditions, which, for instance, imply that the 
derived module category of the algebra A contains the derived categories of the 
quotient algebras A/J; (for each J) as full subcategories. In particular, cohomology 
between A/J;-modules coincides with cohomology of the same modules viewed as 
A-modules. 

The other definition is based on the existence of a collection of modules, called 
standard modules, A,,..., An. Homomorphisms and extensions between these 
modules are allowed only in increasing order; Hom(A;, Aj) # 0 implies i < j 
and Ext!(A;, A ;) # 0 implies i < j. Moreover, their endomorphism rings are skew- 
fields and their composition factors also satisfy an ordering condition. Such standard 
modules occur in nature as Weyl modules of algebraic groups, as Verma modules of 
semisimple complex Lie algebra or as exceptional coherent sheaves. By Dlab and 
Ringel’s ‘standardisation theorem’ [11], behind every set of standard objects (in 
an abelian category) there is a quasi-hereditary algebra. Therefore, quasi-hereditary 
algebras are being studied in many areas, from representation theory over Lie theory 
to algebraic geometry. 

Donkin [12] showed that generalised Schur algebras are quasi-hereditary alge- 
bras. What about Brauer Schur algebras? As explained above, analysing certain 
subsets 6; with 0 < 1 < r/2 of the above basis of the Brauer Schur algebra and 
their multiplication, one can show that the vector space spanned by 83; is isomorphic 
to a classical (type A) Schur algebras S(n, r — 21), mapping basis vectors in B; to 
the respective &; ;-basis. Classical Schur algebras are quasi-hereditary, by Donkin’s 
work [12]. Using that the classical Schur algebras S(n, r — 21) can be embedded 
diagonally into the Brauer Schur algebra as described, it follows together with a 
characterisation of quasi-hereditary algebras by Dlab and Ringel [10]: 


Theorem 3.2 ((27, Theorem 7.1]) The Brauer Schur algebra Sg(n,r) is a quasi- 
hereditary algebra. 


Thus the module category of the Brauer Schur algebra is a highest weight 
category in the sense of Cline, Parshall and Scott. In the proof, a heredity chain 
of the Brauer Schur algebra is constructed by inducing up the heredity chains of the 
diagonally embedded classical Schur algebras S(n, r — 2/). In particular, the simple 
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Sa(n, r)-modules are parametrised in the same way as the simple modules of all the 
classical Schur algebras S(n, r—2/) for 0 < I < r/2 together, that is by the partitions 
into at most n parts of r,r — 2,r—4,...,1 or 0, depending on whether r is odd or 
even. The proof also provides a description of the standard modules (the analogue 
of the Weyl modules) of the Brauer Schur algebra. 

Quasi-hereditary algebras also provide the basic setup for Ringel’s theory of 
characteristic tilting modules, which in the case of classical Schur algebras are 
certain direct sums of tensor powers of exterior powers, as explained in Sect. 1.2.1. 
Ringel’s theory implies that such modules must exist for Brauer Schur algebras as 
well; a detailed description of these modules is, however, still missing. 

Brauer algebras are, in general, not quasi-hereditary. As mentioned already, they 
carry a weaker structure, being cellular in the sense of Graham and Lehrer [21]. 
For a detailed discussion of the cellular structure of Brauer algebras, see [30]. 
Quasi-hereditary algebras A together with a duality (involutory anti-automorphism) 
preserving the heredity chain are cellular. Moreover, then the algebra eAe is cellular, 
too, where e is any idempotent fixed by the duality. The cell (Specht) modules of 
the algebra eAe are just the standard modules of A multiplied by e. In this way, the 
quasi-hereditary structure of the classical Schur algebra induces a cellular structure 
on the group algebra of the symmetric group, and the quasi-hereditary structure of 
the Brauer Schur algebra (but also that of S.n,)(G)) induces the cellular structure of 
the Brauer algebra that has been mentioned above. This provides further relevance 
for the Schur functors to be discussed in more detail in the final subsection below. 


3.3 Homological Properties, Covers and Uniqueness 


Next, we discuss homological properties of the Brauer Schur algebra. Given an 
algebra A, the minimal length of all finite projective resolutions of a given A-module 
is called its projective dimension. The global dimension of an algebra is by definition 
the supremum of the set of projective dimensions of all A-modules. Thus the global 
dimension is either a natural number or infinity. The algebra A has finite global 
dimension, say d if and only if there is non-vanishing cohomology between some 
A- modules in degrees 0,1,...,d, but from degree d + 1 all cohomology vanishes. 
The global dimension is not directly related to representation theory; for instance, 
there is no correlation with representation type (finite, tame or wild). However, 
it characterises classes of algebras: An algebra has global dimension zero if and 
only if it is semisimple; prominent examples of such algebras are group algebras 
where the characteristic of the underlying field is not dividing the order of the 
group. An algebra has global dimension less than or equal to one if and only if it 
is hereditary, that happens if and only if every submodule of a projective module is 
again projective; prominent examples of such algebras are path algebras of quivers. 
By Cline, Parshall and Scott [6, Theorem 4.4] and Dlab and Ringel [9, Appendix, 
Statement 9], all quasi-hereditary algebras have finite global dimension, and hence 
the Brauer Schur algebra has finite global dimension. 
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As we have seen, there is a close relationship between the classical Schur 
algebra A = S(n,r) (for n > r) and the group algebra ks’, of the symmetric 
group on r letters, which is of the form kX’. = eAe for some idempotent element 
e = e € A. Multiplication by the idempotent e defines an exact functor sending 
A-modules to eAe-modules. How strong is the relation provided by this functor, the 
Schur functor? In [19] it has been shown that another homological dimension, the 
dominant dimension of A, controls this situation for classical Schur algebras. The 
double centraliser property (Schur—Wey] duality) can be expressed as saying that 
the dominant dimension of A is at least two, and the quality of the Schur functor, 
when comparing cohomology of Weyl and Specht filtered modules, is measured 
in a precise sense by the dominant dimension of A. Classical dominant dimension 
requires Ae to be a projective—injective A-module. It is not known when @M(I, A) 
is projective—injective over the Brauer Schur algebra. Therefore, when trying to find 
an analogue of the results in [19] for the Brauer Schur algebra, one has to work 
with a relative dominant dimension, with respect to @M(I,A). The Schur—Wey] 
duality between the Brauer algebra and its Schur algebra is equivalent to this relative 
dominant dimension being at least two. The precise value of this relative dominant 
dimension so far has not yet been determined. 

Instead of starting with a given algebra A and asking for its relation with a 
particular centraliser algebra eAe, we may ask conversely: Given an algebra B, can 
we find a quasi-hereditary algebra A such that B ~ eAe? Can we even find the best 
possible A, that is, the one with the strongest connection to B? To investigate such 
questions, Rouquier [37] introduced the concept of quasi-hereditary covers. This 
concept is closely related to relative dominant dimension with respect to a projective 
module that is not required to be injective. More precisely, for a natural number i, 
an i-cover is defined in order to compare the cohomology up to degree i of modules 
of certain algebras. A quasi-hereditary algebra A is called a 0-cover of eAe for an 
idempotent e € A, if A and eAe satisfy a double centraliser property with respect to 
the bimodule eA. And A is a 1-cover of eAe with respect to the bimodule eA, if in 
addition, the category F4(A) of A-modules with a filtration by standard modules is 
equivalent via the Schur functor e - — to the category F.4-(Ae) of eAe-modules 
with a filtration by modules Ae. So, by the result of Hemmer and Nakano, the 
classical Schur algebra S(r, r) is a quasi-hereditary 1-cover of the group algebra k,, 
provided the characteristic of k is different from two and three. Rouquier showed 
that 0-covers always exist; he also showed that 1-covers, if they exist, are satisfying 
a universal property that makes them unique up to Morita equivalence. 


Theorem 3.3 ((27, Theorem 11.4(b) and (c)]) When the characteristic is different 
from two or three, the Brauer Schur algebra Sp(n,r) is the quasi-hereditary 1- 
cover of the Brauer algebra B,, and as such it is uniquely determined up to Morita 
equivalence. 


In this abstract setup, Brauer Schur algebras have come up for the first time, in 
[25]; the explicit construction in [27] is more general, since it does not need any 
assumption on the characteristic. 
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In the language as explained above, choosing idempotent e : @M(I,A) > 
M(0, (1")) = B, in Sg(n,r), the two algebras B, and Sg(n,r) are in Schur—-Wey] 
duality with respect to the bimodule e- Sg(n, r) = @M(1, A) where the sums always 
run over all possible permutation modules. By [24], needing the characteristic of 
the underlying field being different from two and three, we can identify extension 
spaces between modules with standard filtration over Sg(n, r) with extension spaces 
over B,: 


EXts, (7,7) (X,Y) = Ext, (eX, e¥), 


which implies that the Brauer Schur algebra is a quasi-hereditary 1-cover of the 
Brauer algebra B,. See [25, Sects. 11-13] for explicit statements and for more 
information. 


3.4 A Lie Theoretic Context for Brauer Schur Algebras 


We now return to working again in the context of orthogonal and symplectic groups, 
and hence choose again the parameter 5 of the Brauer Schur algebra to be n or —n, 
respectively. Schur—Weyl duality is given by a bimodule, and this in turn gives rise 
to two functors, the Schur functor and the inverse Schur functor. In the context of 
the classical Schur algebra, the tensor space is a bimodule for the classical Schur 
algebra S(n, r) and the group algebra of the symmetric group & which gives rise to 
the Schur functors and the inverse Schur functor: 


mod-G D mod-Sen,(G) = mod-S(n, r) =mod-Ends, (E®”) = mod-Endy, (@M*) 


f g 
mod-kS), 
with G = GL,(k) and functors f = Homg(E®’, —) and g = — @s, E®”. In case 


one of the involved algebras is semisimple, for example, when the characteristic of 
the underlying field is zero, these functors provide the equivalence between k»’,- 
mod and S(n, r)-mod; and otherwise, they provide an exact equivalence between 
the category of Weyl-filtered S(n, r)-modules and the category of cell-filtered kX,- 
modules, provided the characteristic of the underlying field is different from two and 
three. Thus Schur functor and inverse Schur functor are connecting a certain part of 
the representation theory of general linear groups, with that of symmetric groups, 
that is, the functors provide a connection between Lie theory and combinatorics. 
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The above diagram summarises the following results: 


— Classical Schur—Weyl duality states that S(n,r) = Ends, (E®’). 

— The Schur functor f and the ‘inverse Schur functor’ g (which is a one-sided 
inverse only) relate the representation theories of S(n, r) and of k&’,, specialising 
to an equivalence of categories between Weyl and Specht filtered modules (in 
characteristic different from two and three). 

— The Schur algebra (defined as endomorphism ring) equals the enveloping 
algebra Seny(G) (which is a quotient of the group algebra kG). 

— Tensor space is a relative projective generator of the category of modules with 
Specht filtration, and it is a direct sum of permutation modules. 


We consider now the same situation for Brauer Schur algebras. As in types B, C 
and D, tensor space is different from the sum of permutation modules over B,, there 
are now two sets of Schur functors related to this situation, a pair of functors (fo, go) 
related to the tensor space and a pair of functors (Fy, Gy) related to the full sum of 
permutation modules: 


to = Homg(E®", -), Fu => Hom,,(®M(/, i), —), 
80 = — @B, Ee Gu = — @sp(n,r) (BM, A)). 


The functors fo and go are the Schur functors relating Donkin’s orthogonal or 
symplectic Schur algebra S.,,(G) with the Brauer algebra. The functors Fy and 
Gy are the Schur functors relating representation theory of the Brauer Schur 
algebra with that of the Brauer algebra. By Theorem 1.1, we have Sg(n,r) ~ 
Endg(@yt,—20<1 <f Sym“). Define in addition functors related to the full sum 
of symmetric powers: 


Fs = Homg(@Sym*E, -), Gs = — @sz(n,r) (@Sym’E). 


These functors relate Donkin’s orthogonal and symplectic Schur algebras with 
the Brauer Schur algebras, when the parameter 6 of the Brauer Schur algebras is 
appropriately specialised. 

To summarise, we have the following triangle of functors, relating the representa- 
tion theory of the two different Schur algebras in the top row with the representation 
theory of Brauer algebra in the bottom row. In particular, by Theorem 1.1 we 
have obtained a Lie theoretic context for the representation theory of Brauer Schur 
algebras: 


Gs mod-Endg, (@ M(i, A)) 
mod-Seny(G) = mod-Endg, (E®”) C mod-G —_—, — =mod-55(n, r) 
Fs 


TA 


mod-B,. 
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This triangle commutes in the sense that Gs = gooGy and similarly for the adjoints. 
Indeed, there are isomorphisms of left Sg(n, r)-modules 


M(1, A) @z, E®” ~ M?* @x5,_», eB, @p, E®" ~ Sym’ E @ 8! ~ Sym*E. 


Tensor functor and Hom-functor are adjoint functors. Uniqueness of adjoints then 
implies Fs = Fy ° fo. Again, if one of the involved algebras is semisimple, then all 
involved module categories are equivalent. Of interest is now the question what 
happens when none of these algebras is semisimple. In analogy to the picture 
above in type A, when the ground field & has characteristic different from two 
and three, the functors Fy and Gy are mutually inverse equivalences between the 
exact categories of A-filtered Sg(n, r)-modules and cell filtered B,-modules, by 
(24]. Thus an analogue of the theorem of Hemmer and Nakano [26] mentioned 
above holds true. It is not known how well these two functors compare or identify 
cohomology in higher degrees. Following [19], where the classical type A case 
has been clarified, this amounts to saying that the dominant dimension of Sg(n, r) 
relative to @M (I, A) is not known. It is open whether the other two pairs of functors 
are restricting to equivalences between corresponding categories of filtered modules. 
Also, the (relative) dominant dimensions of generalised Schur algebras (including 
the enveloping algebras S,,,,(G)) are not known. In other words, it is not known how 
good the various Schur functors are in comparing cohomology. 
The above diagram summarises the following results: 


— There is a Schur—Weyl duality between the enveloping algebra Seny(G) and the 
Brauer algebra B,. The bimodule affording the Schur—Weyl-duality is tensor 
space. 

— There is another Schur—Weyl duality between the Brauer Schur algebra Sz(n, r) 
and the Brauer algebra B,. The bimodule is a full direct sum of permutation 
modules of B,. These are relative projective in the category of Specht filtered 
modules, which in general does not contain tensor space. 

— The enveloping algebra Seny(G) and the Brauer Schur algebra Sp(n,r) are in 
general different algebras. 

— There is a third Schur—Weyl duality between Seny(G) and Sp(n, r). The bimodule 
is a full direct sum of tensor powers of symmetric powers. In particular, Sz(n, r) 
is the endomorphism ring over G of this module (Theorem 1.1). 

— The Schur functor Gy and the ‘inverse Schur functor’ Fy (which is a one-sided 
inverse only) provide an equivalence of categories between Weyl and Specht 
filtered modules (in characteristic different from two and three). 

— The other two pairs of adjoint functors, (go,fo) and (Gs, Fs), relate the repre- 
sentation theories of the respective algebras, but are not known to specialise to 
equivalences of subcategories. 


As explained in Sect. 1, the Lie theoretic context provided here is strong enough 
to solve the classical problem of describing endomorphisms of sums of tensor 
products of symmetric powers. The precise nature of the connections given by the 
above triangle of functors remains, however, to be explored. 
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All of what just has been said refers to the case of Brauer Schur algebras with 


integer parameters 6 in order to have a link to orthogonal or symplectic groups. If the 
parameter 6 is not an integer, the Brauer Schur algebra still is quasi-hereditary and 
its module category is a highest weight category. No Lie theoretic object is known 
that can play the role of Sen» (G) or of G itself in the above triangle of functors. 
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Arithmetic Functions: A Pivotal Topic 
in the Scientific Work of Wolfgang Schwarz 


Karl-Heinz Indlekofer 


Dedicated to the memory of my academic teacher Wolfgang 
Schwarz 


Abstract The purpose of this article is to describe some questions which have 
arisen from discussions in the working group of Wolfgang Schwarz in the early 
1970s. We concentrate on problems concerning arithmetical functions and deal 
with investigations of almost-even, limit-periodic, and almost-periodic functions. 
We give a survey of relevant results by Schwarz and Schwarz—Spilker, respectively, 
and add corresponding contributions of the author. 


Keywords Almost-even and almost-periodic functions * Asymptotic results on 
arithmetical functions * Boundary behavior of power series * Compactifications 
of N and integration * Conformally equivalent series « Power series methods of 
summability * Ramanujan expansions * Uniformly summable functions 


2010 Mathematics subject classification: Primary 11K65; Secondary 11N37, 
11N56, 11N60, 11N64, 30B30, 40G05, 40G10 


1 A Selection of Mathematical Subjects Discussed 
in the Group of Wolfgang Schwarz in the Early 1970s 


When I came to Frankfurt in October 1969 as an assistant in the working group of 
Wolfgang Schwarz the initial phase was characterized by the involvement with the 
doctoral dissertation and preoccupation with various mathematical topics treated by 
Schwarz in his lectures and seminars. I could benefit a lot from his motivating and 
inspiring suggestions. In this article I will describe the main topics discussed in the 
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group of Wolfgang Schwarz during the early 1970s. I present the related results of 
Schwarz referring in many cases to the monograph [62] of Schwarz and Spilker. 
Additionally, I complement this survey by results which I could contribute to these 
topics. 


I. Equivalent power series 


In 1958, equivalent power series were introduced by Turan [63]. To this end let 
D := {z€C: |z| < 1} be the open unit disk and D := {z € C: |z| < 1} the closed 
unit disk of the complex plane, and denote by H(D) the algebra of all holomorphic 
functions in D. If ¢ € D, ¢ # 0, then 


_ 
MO) = Tor Tae 


defines a conformal and bijective mapping ¢; of D onto itself with oc(1) = 1. If 
f € H(D), then f, = f o d € H(D), and the Taylor series representing f and fj, 
respectively, are called conformally equivalent. 

Turan investigated the periphery-convergence behavior of equivalent power 
series, and he showed that the convergence is not conformally equivalent. More 
explicitly this means the following: he proved the existence of functions f € H(D), 
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the Taylor series 
CO 
fw) = DT anew (1.1) 
k=0 
of which converges for w = 1, whereas the power series 


fil2) = f(e(2) = YS bn(Oz" (1.2) 


k=0 


diverges for z = b '(1) = 1. Such a function is called konvergenz-schlecht (badly 
convergent) under ¢; (see Schwarz [51]). In 1967, Clunie [3] gave, for each fixed 
0 < |¢| < 1, an example of a function f € H(D), which is continuous on D, 
such that (1.1) converges for w = 1 and the transformed series (1.2) diverges for 
z = |. Inthe same year Schwarz [51] sharpened Clunie’s result by using the theorem 
of Banach-Steinhaus. In [52] he investigated functions f € H(D) which satisfy 
some certain modulus of continuity on the boundary of D and are badly convergent 
under ¢;. These results motivated and influenced my dissertation which is published 
in [13, 14, 16]. In [15] I could improve the result of Schwarz [52] by verifying a 
conjecture of Paul Turan. A more detailed discussion of this topic may be found in 
[38] and in the survey article [31]. 


II. Multiplicative functions of modulus < 1 


The results of Delange [9], Wirsing [66], and Haldsz [12] aroused the interest of 
Wolfgang Schwarz in the investigation of multiplicative functions. For arithmetical 
functions f : N > C let us denote the means of f by 


1 
M(f.x) = =f (n) 


nsx 


and call 
M(f) := lim M(f,x) (1.3) 


the mean-value of f if the limit (1.3) exists. Delange [9] essentially characterized 
those multiplicative functions f, |f| < 1, for which the mean-value M(f) exists 
and is different from zero. Wirsing [66] showed in 1967 that every real-valued 
multiplicative function of modulus < | possesses a mean-value and solved an old 
conjecture of Erd6és and Wintner: Any multiplicative function f, assuming only the 
values 1,0, and —1, possesses a mean-value. Choosing f = kj, the Mobius function, 
this result implies the prime number theorem. 

One year later Halész [12] gave a complete description of the asymptotic 
behavior of M(f,x), || < 1, and showed: 
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Let f : N > C be multiplicative of modulus |f| < 1. Then there exist constants 
Cc, ag and a slowly oscillating function L of modulus |L| = 1, so that the asymptotic 
formula 


wi) = cx!*07 (log x) + o(x) 


nsx 


is true. 

Motivated by papers of Novoselov [47, 48] which we discussed in our Seminar 
(1971/72), the question was posed whether, for functions f occurring in Haldsz’s 
result and possessing a mean-value M(f), this mean-value can be represented as an 
integral. 

The idea of Novoselov was, briefly summarized, as follows: 


— The ring Z of the integers is embedded into the compact topological ring S of the 
polyadic numbers. 

— Then, on the additive group of the ring S, as a compact group, there exists a 
normalized Haar measure P defined on a o-algebra A which contains the Borel 
sets in S such that (S,.A, P) is a probability space and P is the extension of the 
asymptotic density. 

— This enabled Novoselov to develop an “integration theory” for the space of limit- 
periodic functions, i.e., for arithmetic functions which can be approximated by 
periodic functions with integer period. 


Then the posed question may be formulated as 
Problem: Letf be a multiplicative function such that the mean-value M(f) exists. 
Is it possible to represent M(f) as an integral 


min = [ Fp 


N 


where N is a compactification of N, f is an extension of f to N, and p is an 
appropriate measure on N? 


Ill. Spaces of arithmetical functions 


This topic is closely connected with the problem of extending the results of Delange, 
Wirsing, and Halasz, respectively, to “unbounded” (multiplicative) functions. 

To start from the beginning we denote by B and D the spaces of all even functions 
and all periodic functions, respectively. The base for further questions was the very 
nice paper by Schwarz and Spilker [61] in 1971. They showed that B is equal to the 
space of all linear combinations of Ramanujan sums c, where 


cr(n) = > €a/r(n) 


l<a<r 
gcd(a.r)=1 
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and every f € D can be written as a linear combination of the exponential functions 


Ca/r> 
a 
€a/r(n) = exp(2mi-n) (a,r € N) 
r 
1.€., 


B = Linc[c;: r = 1,2,...] 


D = Lincleajr 2 r = 1,2,...;1 <a <r, gcd(a,r) = 1). 
In addition, define the vector space 
A := Lincle, :a@ € R mod Z| 


of complex linear combinations of e,. Taking the closure of B,D, and A with 
respect to the supremum norm ||.||,,, defined by 


If llu = sup|f(n)| for f € £°, 


neN 
we atrive at the spaces 
BY": |\.\|, closure of B, uniformly almost — even functions, 
D" : ||-llu closure of D, uniformly limit — periodic functions, 
A”: ||.||, closure of A, uniformly almost — periodic functions. 


These three spaces are the starting points for further investigations. They are B*- 
algebras 


BYCcDY cA" Cc l™ (1.4) 


and Gelfand’s theory can be applied. Further, the mean-value M(f), the Fourier- 
coefficients M(fe,), and Ramanujan-coefficients M(fc,) exist for all f € A” and 
motivate questions concerning Fourier- and Ramanujan-expansions. The character- 
ization of multiplicative functions belonging to these spaces is an obvious question. 

When J. Knopfmacher held a visiting professorship in Frankfurt for a period 
during 1973 he lectured on many topics of his “Abstract analytic number theory” 
and discussed the question to transfer the above described theory to arithmetical 
semigroups. In Chap.7 of his book [40] he describes the generalized Fourier 
theory of even and almost-even functions. Multiplicative functions on arithmetical 
semigroups and analogues for Haldsz’s theorem, etc., and similar questions are 
handled, for example, by Indlekofer and Manstavicius [35, 36] by Barat et al. [2] 
and by Indlekofer and Kaya [34], but here we will not deal with these topics. 
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The interest of Wolfgang Schwarz focussed further in the completion of B, D, 
and A with respect to the semi-norm 


1/q 
; 1 
IFlla := (isp > ire) 4q2) 
X—>0O 


nsx 
which he denoted by 
BY’, D1, and A’, respectively, 


the spaces of g-almost-even, q-almost-limit-periodic, and q-almost-periodic func- 
tions. Here again the mean-values M(f), M(fe,,), and M(fc,) exist for every f € A’. 
In this context he dealt with questions about 


(i) (general) properties of g-almost-periodic functions, 
(ii) characterizations and properties of multiplicative functions from BY’, ‘D4, 
and A’, 
(iii) pointwise convergence of Ramanujan expansions for multiplicative functions. 


In this paper we describe the results obtained for (ii) and (iii) which can be found 
in the monograph [62] by Schwarz and Spilker and add some aspects which are not 
included in the book (cf. Preface of [62], p. xiii). For example, we deal with the ||.||1- 
closure of £%, the space £* of uniformly summable functions (cf. [62]) and give the 
characterization of the multiplicative functions f € £* (independent whether the 
mean-value M(f) exists or not). We describe the answer of Schwarz and Spilker 
to the problem of representing the mean-value as an integral and compare it with 
a method introduced by me in 1992 (see [25], Preface of [62], p. xiii, and [60], 
Sect. 6.3). 


2 Characterization of Multiplicative Functions 
inBIc Dic A%c L* 


2.1 Multiplicative Functions in L* with Non-zero Mean-Value 
For g = 1 let 
Lt ={f:N>C,|lfllq < c0} 
be the vector space of functions f with bounded semi-norm || f||,. Obviously 
MeMcA ch el cx 7 ga, 


In 1980, I [17] could prove 
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Theorem 2.1 Let f : N > C be multiplicative and q => 1. Then the following two 
assertions hold: 


(i) Iff ¢ £* 1 £4 and if the mean-value M(f) exists and is different from zero, 
then the series 


-1 — 1/7 
y f(p) y = | (2.1) 
|f(p)l<3 


and 


s Ae yy te i (2.2) 


P Pp k>2 
f)-1l=4 


converge for all A with 1 < |A| < q and, for each prime p, 


oo k 
rey! a. (2.3) 


(ii) If the series (2.1) and (2.2) converge, then f € £* N £4, and the mean-values 
M(f), M(|f|*) exist for all A with 1 < A < gq. If in addition (2.3) holds, then 
M(f) # 0. 

Elliott proved the assertions of Theorem 2.1 for g = 2 in 1975 [10] and for g > 1 
in 1980 [11]. Daboussi [5] published a proof for the case g > 1 in 1981. A general 


characterization of uniformly summable multiplicative functions will be described 
in Sect. 2.4. 


2.2 Multiplicative Functions in B41, D4, and A‘ with Non-zero 
Mean-Value 


It turns out that these functions are just the same as those in 2.1. From [62] we cite 


Theorem 2.2 ((62], Chap. VII, Theorem 5.1) Let f be a multiplicative arithmeti- 
cal function with mean-value M(f) # 0. Assume that q => 1. Then the following 
statements are equivalent: 


(1) The series (2.1) and (2.2) converge for A = q. 
(2) f € BY, 
GB) feds 
(4) fe AY 


186 K.-H. Indlekofer 


If q = 2, then (1) to (4) are equivalent to 
(5) Illa < ©. 


Furthermore, in any case and for every prime p, 


l+p 'f(p) +p f(p?) +... #0. 


Obviously each of conditions (1)—(4) is equivalent tof ¢ £2 L£*, and is equivalent 
to || fllg < 0, if¢ > 1. 


2.3 Multiplicative Functions in B41, D1, and A‘ with 
Mean-Value Zero 


If the mean-value equals zero the situation differs considerably from the previous 
one and leads to results distinguishing between g-almost-even and q-limit-periodic 
functions. For this we define the Fourier-Bohr spectrum o(f) of f by 


o(f) := {a € R/Z: limsupx "| SY F(n) exp(—2sina)| > O}. 


n<x 


The Ramanujan spectrum o* (f) of f is the set 


o*(f):= {re N: limsupx |} f(n)e-(n)| > 0}. 


nsx 


In 1981, 1 [19] proved the following results: 


Theorem 2.3 Let f : N — C be multiplicative and q = 1. Then the following 
assertions are equivalent: 


(i) f € BY and ||f ||; > 0. 
(ii) f € B4 and the spectrum o* (f) of f is non-empty. 
(iii) f € £11 £&* and there exists a principal Dirichlet-character yo so that the 
mean-value M(f Xo) of f Xo exists and is different from zero. 
(iv) The series (2.1) and (2.2) converge for all i with 1 < A < q. 


The case | f| < 1 was handled by Schwarz and Spilker in [62], Theorem 5.1. The 
main content of Theorem 2.3 may be found in the Exercises 2 and 3 of VII.7 in [62]. 


Theorem 2.4 Let f : N — C be multiplicative and q => 1. Then the following 
assertions are equivalent: 


(i) f € D4 and || fli > 0. 
(ti) f € D4 and the spectrum o(f) of f is non-empty. 
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(iii) f € £9 &* and there exists a Dirichlet-character y so that the mean-value 
M(f x) of fx exists and is different from zero. 
(iv) There exists a Dirichlet-character y such that the series 
-—1 —1/? 
y F(p)x(p) = 1 y lf P)xXP) = 1 (2.4) 
p 


Pp P P 
\fls¥ 


and (2.2) converge for all X with 1 <2 < q. 


The equivalence of (ii) and (iv) was proved by Daboussi [4]. Schwarz and Spilker 
deal with this equivalence in Theorem 6.1 of Chap. VII in [62]. 

In the proof of Theorem 2.4, I used the fact that if f € £* is multiplicative then 
o(f) contains only rational values. This result is best possible in the sense that there 
are multiplicative functions f € £!\L* such that o(f) contains irrational values. 
Further, the result says that if f ¢ A’ is multiplicative, then f ¢ D?. Daboussi (see 
[6] and [46]) gave a proof of this result for bounded multiplicative functions, where 
Daboussi—Delange’s paper [7] treats the case f € £7. In Lemma 6.3 of Chap. VII in 
[62] Schwarz and Spilker give a proof for f € A!. 

Further generalizations may be found in the papers [32] and [33] by Indlekofer 
and Katai. 


2.4 Multiplicative Functions in £* 


Using the same proof as in [17] one can prove the following characterization: 
Theorem 2.5 Let f € £* be multiplicative and || f\|, > 0. Then the series 


- 3 lf(p)| (2.5) 


P Pp 


D P 
IA(PI-1>4 If(P—U>4 


and (2.2) converge for 4 = 1, and there exists a sequence | < x1 <x. <... 70 
such that the partial sums 


=a | 
y If a (2.6) 
PSXk 


are uniformly bounded (k = 1,2,...). 


This implies, with the method given in [17], 
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Theorem 2.6 Let f € £* be multiplicative and ||f||1 > 0. Then there exists a 
constant c > 0 such that 


e* > Feo) = oo 


n<x 


-1 
a Mun =) (c + o(1)) (2.7) 


psx 


asx > ©. 


Since the sequence {M(|f|,7)} of the means is bounded, (2.7) implies that there 
exists a constant c’ such that 


> \f@l= 1 <c for all x>2. (2.8) 
P 


psx 


Repeating the arguments used in [17], Sect.4, word by word, one can show that 
(2.8) and the necessary conditions in Theorem 2.5 imply that f € £* and || f||; > 0. 
Further, one concludes that || f||; = 0 if and only if one of the series in (2.2) or (2.5) 
diverges or the sum in (2.6) tends to oo for all sequences {x;}, x, > ©. 

A general representation for multiplicative functions in £* is given in 


Theorem 2.7 Let f € £* be multiplicative and ||f ||; > 0. Then either the mean- 
value M(f) exists and equals zero or there exist constants ag € R and0 4 co € C 
such that, as x —> oo, 


el >“ f(n) — xia exp (x fen) (co + o(1)). 


n<x psx 
If 
‘ Imf (p)p— 
Poy 
psx P 
then 


lim sup |A*(y) —A*(x)| = 0 


x<y<x? 


If the mean-value M(|f|) exists, then the proof of Theorem 2.7 can be found in 
[18]. For the general case, see [22]. 

As a consequence of Theorem 2.7 we mention a generalization of the cited result 
by Wirsing. 


Corollary 2.8 Let f € L£* be real-valued and multiplicative. If M(|f|) exists, then 
M(f) exists. 
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Concluding remarks For a quantitative form of Wirsing’s result we refer to [30]. 
The connections of the above results with the existence of the limiting distribution of 
multiplicative functions and the characterization of additive functions are described 
in [20, 21, 24] and in [23], respectively. In the final chapter of [62] Schwarz and 
Spilker present the elementary proofs of the cited Halasz’s theorem by Daboussi 
and Indlekofer [8]. For supplementary informations see [26] and [29]. 


2.5 Pointwise Convergence of Ramanujan Expansions 
for Multiplicative Functions in B' 


With their paper [61] Schwarz and Spilker start up a Fourier theory for almost- 
even functions. For example, the set {(y(r))~2¢; :r = 1,2,...} is a complete 
orthonormal system in 8? and Parseval’s equation holds. Every f € B! possesses a 
Ramanujan expansion 


f= Va(her 
where the (Ramanujan) coefficients a,(f) are given by 


1 
a-(f) = one 


Therefore it becomes of interest to investigate the pointwise convergence to f of 
the associated Ramanujan series. From 1973 to 1975 Schwarz dedicated himself to 
this problem in the case of multiplicative functions (see [53—59]) and succeeded to 
verify the following: 


Theorem 2.9 Let f be multiplicative of modulus |f| < 1. If M(f) # 0, then its 
associated Ramanujan expansion is pointwise convergent and 


CO 


Y-a,(fer(n) =f(n) for any neN. (2.9) 


r=1 


In 1980, Tuttas [64] extended this result to multiplicative functions with non-zero 
mean-value belonging to B. In 1983, using [17], Warlimont [65] showed that if 
f € B! is multiplicative and if the series (2.1) and (2.2) converge, then (2.9) holds. 
Bearing in mind Theorem 2.1 and by Warlimont’s result we get immediately the 
following refinement of Theorem 2.9: 


Theorem 2.10 Let f ¢ B! be multiplicative and || f\|; > 0. Then the assertion of 
Theorem 2.9 holds true. 
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There are some fascinating examples of pointwise convergent Ramanujan expan- 
sions for particular functions of interest, which cannot be explained in the previous 
context. For example, the following formulas for the divisor function d(n) and the 
counting function r(n) of the numbers of representations of n in the formn = a?+b? 
(a, b € Z) have been established by Ramanujan [50]: 


co 


logk 
d(n) =-)~ ~ cx(n) (2.10) 


k=1 


r(n)=az 3 cod as Cor—1 (n) (2.11) 
= > 2k 1 . 
— el 


The fact that the formulae do not fit within the previous framework is obvious 
in the first case since the mean-value of d does not exist. In the second case, the 
function r has the form r = 4(1 * x) with the non-principal Dirichlet-character y 
mod 4. The mean-value M(r) exists and equals z, so that (2.11) has the general 
appearance as the formula considered in Theorem 2.9. Further, it shows that r € £! 
but it turns out by Theorem 2.1 that r is not uniformly summable. 

A concept which can handle such “mysterious” Ramanujan expansions and 
which is not based on almost-even functions has been introduced by Lucht [43] 
in 1993. In his article [44] he gives a detailed description of this topic. 


3 Integration 
3.1 Integration via Gelfand Representation 


The algebra B” is a commutative algebra with identity which is closed under the 
involution f — f (complex conjugation). Therefore the Gelfand Representation 
Theorem is applicable and implies that B“ is isomorphic and isometric to the algebra 
C(X) of all continuous complex-valued functions on a certain compact Hausdorff 
space X—its maximal ideal space, denoted by X = Ag. The following theorem, 
which does not depend on the Gelfand Representation Theorem, provides slightly 
sharper information as to how (uniformly) almost-even functions can be represented 
by continuous functions on an explicit compact space. For this Schwarz and Spilker 
[61] introduced the compactification N* of N by 


Fa 
N*=[], 
P 
prime 


where Np the one-point compactification of the discrete (and locally compact) space 
(1,p,p’,...). 
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Theorem 3.1 Let 
1:N—N* 
be the embedding of N into N*. Then the induced homomorphism 
1*: C(N*) > €%, 1*F= Fo 


maps C(N*) isomorphically and isometrically onto B". 


Knopfmacher [40] transferred this theorem to arithmetical semigroups. In [62], 
Chap. IV, Schwarz and Spilker describe the Gelfand’s theory for the algebras B“ 
and D", 

In the case D“ of (uniformly) limit-periodic functions they show that the maximal 
ideal space Ap of D”“ is homeomorphic to the Priifer Ring Z. In order to define Z, 
first consider any n and the residue class ring Z/nZ with the discrete topology. Now 
form the topological product X = [| Z/nZ. Finally, consider the natural projection 


Inn: L/LZyn > Z/Zm, if min 
and define the compact space Z by the projective limit 
Z:= {(Qn) €X in € Z/Zy and Tmn(On) = Om, if mln}. 
Mauclaire [45] showed that the Bohr compactification of Z is the appropriate space 


for corresponding investigations of (uniformly) almost-periodic functions. 


Remark The compactification Z was given by Priifer [49] in 1925. In 2002, Kubota 
and Sugita [42] introduced a compactification Z of Z by Z = [[Z, where Z, 


Pp 
denotes the ring of p-adic integers. It turns out (see Barat et al. [1]) that the compact 
rings of Priifer, Novoselov, and Kubota—Sugita are homeomorphic to each other. 


The construction of Schwarz and Spilker for an “integration theory on B“” is now 
straightforward. 
The mean-value M(f) of f is a nonnegative linear functional on B". It is bounded 
(and continuous) and extends uniquely to C(N*) by 
M* : C(N*) > C 
M*(F) = M(f 01) for every F € C(N*). 


Then M* is a bounded, nonnegative linear functional and Riesz’s Theorem yields 


Theorem 3.2 (see [62], Chap. IV, Theorem 6.1) There exists a complete and 
regular probability measure 6, defined on a o- algebra A, containing the Borel 
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sets of N*, with the property 


[Fai = M*(F) = M(Fo1) 


N* 
for every F € C(N*). 


The measure 4 and the o -algebra A can be characterized as a product measure 
on N* = [[N, and via the product o-algebra of N*, respectively. For details, see 


[62], Chas 1 

In 1976, Knopfmacher [41] showed that the quotient space 64/ nullspace 
corresponds to L4(N*, 8), and that the whole theory can be extended to arithmetical 
semigroups. 

Similar conclusions as in Theorems 3.1 and 3.2 can be done for the B*-algebras 
D* and A“. But Schwarz and Spilker did not follow up this topic. It should be 
alluded the special role played by the asymptotic density in the investigations 
of the spaces B“, D“, and A”. Further, despite of the ad hoc construction of the 
compactifications, the “size” of these spaces is “restricted”; the Mobius jz function, 
for example, is not an element of any of these spaces. To avoid these limitations the 
author described in the 1990s an “integration theory” which is based on the Stone— 
Cech compactification of the natural numbers. 


3.2 Integration via the Stone-Cech Compactification 


Suppose that A is an algebra of subsets of N, i.e., 


Gi) NEA 
(ii) ABE ASAUBEA 
(iii) ABE ASA\BEA 


Then, if € denotes the family of simple functions on N, the set 


E(A) := {s€ Es = ala; ao EC, AVE A, f= l,...,m} 


j=l 


of simple functions on A is a vector space, especially an algebra, such that €(.A)", 
the ||.||,, -closure of €(A), is a B*-algebra. 

The main difficulties concerning the immediate application of probabilistic tools 
to the investigation of additive and multiplicative functions in number theory arise 
from the fact that the asymptotic density defines only a finitely additive measure 
on the family of subsets of N having an asymptotic density. To overcome these 
difficulties we proceed as follows: N, endowed with the discrete topology, will be 
embedded in a compact space BN, the Stone-Cech compactification of N, and then 
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any algebra A in N with an arbitrary finitely additive set function, a content or 
pseudomeasure on N, can be extended to an algebra A in BN together with an 
extension of this pseudomeasure, which turns out to be a premeasure on A. The 
basic necessary concepts are summarized in the following three propositions: 


Proposition 3.3 There exists a compactification BN of N with the following 
equivalent properties: 


(i) Every mapping f from N into any compact space Y has a continuous extension 
f from BN into Y. 
(ii) Every function from £° has an extension to a function in C(BN). 
(ili) For any two subsets A and B of N, 


ANB=ANB 


where A = clpnA and B= clpnB are the closures of A and B in BN, 
respectively. 
(iv) Any two disjoint subsets of N have disjoint closures in BN. 


An immediate consequence of (iii) is the following statement: 
Proposition 3.4 The compactification BN of N has the following property: 
(v) For any algebra A in N, the family 


A:={A:AeEA} 


is an algebra in BN. 


This property is equivalent to properties (i)—(iv) of Proposition 3.3. 

It should be observed that BN is unique in the following sense: if a com- 
pactification N of N satisfies any one of the listed conditions, then there exists a 
homeomorphism of BN onto N that leaves N pointwise fixed. 

As a consequence of Property (i), we obtain the following: 

The identity mapping z : N — N is a continuous monomorphism, which sends 
N onto a dense subset of 6N, such that the adjoint homomorphism 


i* : C(BN) > £%, i*(f) = fot, 


maps C(6N) isomorphically and isometrically onto £~. 

We are now in position to formulate the following fundamental result: 
Proposition 3.5 Let A be an algebra in N and 6 : A — [0, 00) be a content on A. 
Then, the map 


5: A= [0,00), 8(A) = 8(A), 
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is o-additive on A and can uniquely be extended to a measure on the minimal o- 
algebra o(A) over A. 


We remark, as an immediate implication of the above construction, the following: 


(i) Every finitely additive function on an algebra A in N can be extended to a 
finitely additive function on the algebra of all subsets of N. 

(ii) Every linear functional on the vector space E(.A) can be extended to a linear 
functional on 2%. 


Let us now assume that every A € A possesses an asymptotic density 6(A) and 
define: 

For a given algebra A and for g > 1 denote by £*4(A) the ||.||, closure of E(A). 
A function f € £*4(A) is called q-uniformly (A)-summable. By L*4(A) we denote 
the quotient space £*4(.A) modulo null functions. 

A first consequence of the above construction is that, for all s € E(A), 


M(s) 7 | 3dé 


where s : BN — C denotes the extension of s. 
Starting from this one arrives at integrable function on the probability space 


(BN, o(A), 6) and 
L!(8) := L'((BN, (A), 8) = {f : BN > C, || fl] < 00} 


where 
lls hie 
BN 


There exists a vector-space isomorphism between the spaces L*(A) and L! (8) 
(for details cf. [27-29]). The same conclusions hold for L*4(A) and L4(8) if q>l. 

We shall apply the outlined construction to three examples and establish the rela- 
tions to the results of Schwarz and Spilker. Further, we show how our construction 
of integration is related to Gelfand’s theorem. Finally, we determine a complete 
orthonormal system for a Hilbert space whose elements can be approximated by 
linear combinations of multiplicative functions. 


Example 1 Let A; be the algebra generated by all zero residue classes. The 
asymptotic density is a content on A), 


E(A,)" = Be, 
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and 
L*4(A,) = B4/nullspace, q > 1. 


Example 2 The algebra A, generated by all residue classes, is our second 
example. Here again the asymptotic density is a finitely additive function on A, 
and 


E(Ad)" =D". 


For q > 1, L*4(A2) ) is isomorphic to the L1-space of Novoselov’s integration 
theory. 


Example 3 Let be a multiplicative function which assumes only the values —1, 0 
and 1, and define the sets 


A; := {n: f(n) = 1} 
Ay := {n: f(n) = 0} 

and 
A; := {n:f(n) =—]} 


with characteristic functions f*, f°, and f—, respectively. Obviously 


1 
f= sth 
fai-ft fr 
_ ol 
f = 5(lfl-#. 


We define the algebra A; to be the algebra generated by the sets A; AY, Ay for all 
multiplicative f with f(N) Cc {—1,0, 1}. An arbitrary set A of A3 has a characteristic 
function which is a linear combination of such multiplicative functions, and thus, 
by Wirsing’s theorem, the asymptotic density 5(A) exists. This leads to the spaces 
L*4(.A3) the elements of which can be approximated by linear combinations of 
multiplicative functions. 


The aforementioned connection with Gelfand’s theory is described in the follow- 
ing theorem and yields a generalization of Theorem 3.2. 


Theorem 3.6 (see Indlekofer and Wagner [37]) Let F be a B*-algebra of 
complex-valued functions from €* and let L be a positive linear functional on F 
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with L(y) = 1. Then there exists an algebra A of subsets of N and a content 5 on 
A such that 


(i) eachf € F belongs to the ||.||,-closure of E(A) and 
(ii) for eachf € F the relation 


Lif) = | fab 
! 


holds. 


3.3 Complete Orthonormal Systems 


Starting with BY“, ‘D“, and A” an application of Theorem 3.6 leads in any case to 
some algebra the elements of which possess an asymptotic density. 
Let A be such an algebra. Then, one can define an inner product on L*?(A) 


<f,g >= M(fa), fig € L(A), 
and by this L*?(A) is a Hilbert space. In [62] Schwarz and Spilker described 
complete orthonormal systems for their respective spaces and proved 
Theorem 3.7 (see Corollary 3.2 in [62], Chap. IV) 
(i) The set {(p(r))“\/? + c,, r = 1,2,...} is a complete orthonormal system in 
2 
(ii) The set {ea/rs r=1,2,..., 1<a<vpr, gcd(a,r) = 1} is a complete 


orthonormal system in D?, 
(iii) The set {eg :a@ € R mod Z} is a complete orthonormal system in A’. 


Here I sketch the idea how to determine a complete orthonormal system for the 
Hilbert space L*?(.A3). For this let 


R:={f:N—->R; f multiplicative and f(N) c {-1, }}} 


be the set of all multiplicative functions assuming the values +1, and define a 
relation ~ on R by 


1 
f~g iff \° —<o. 
ae 
St (Pap) 


Obviously ~ is an equivalence relation. Now choose a representative from each 
equivalence class and define F; as the complete set of all representatives. If f, g € 


Arithmetic Functions: A Pivotal Topic in the Scientific Work of Wolfgang Schwarz 197 


Fi, f # g,1.e.,f and g are not in the same class, then f(p)g(p) takes only the values 
+1, and 


y 1~f(p)s(P) _ 


z p 


By Wirsing’s theorem M(fg) = 0. Further, if f € F;, then f? = 1 and M(f”) = 1. 
This shows that 7; is an orthonormal system (but not complete). Choosing the 
normalized Ramanujan functions c* = (g(r))~'/*c, we put 


FroiafecifeF, r=1,2,..3 


which turns out to be a complete orthonormal system for L*”(.A3). More details can 
be found in [39]. 
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Aleksandar Ivié 


Dedicated to the memory of Prof. Wolfgang Schwarz 


Abstract In this overview paper, presented at the meeting ELAZ2014, Hildesheim, 
July 28—August 1, 2014, we present some selected works of the eminent mathemati- 
cian Wolfgang Schwarz. This choice is personal and reflects the common research 
interest of the author and Prof. Schwarz. 


Keywords Arithmetic functions * Asymptotic formulas * Riemann zeta- 
function * Tauberian theorems 


2010 Mathematics subject classification: Primary 11N37; Secondary 11N45, 
11N56, 11P82, 40E05 


1 Introduction 


Wolfgang Karl Schwarz (21 April 1934-19 July 2013) was a prolific and eminent 
German mathematician, working primarily in number theory. This paper presents 
some of his number-theoretic works, the choice being influenced by the common 
research interests of Prof. W. Schwarz and the author. 

I had the pleasure of meeting W. Schwarz for the first time during the Oberwol- 
fach meeting on “Elementary and Analytic Number Theory” in November 1980, 
although I started corresponding with him a few years before. W. Schwarz (together 
with H.-E. Richert and E. Wirsing) was one of the organizers of these Oberwolfach 
meetings at the “Mathematsches Forschungsinstitut” for many years. 

Before I met him, we already wrote a joint paper on some number-theoretic equa- 
tions (see [24] and Sect. 7.2). Definitely, Prof. Schwarz had a strong mathematical 
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influence on me. For example, the paper [25] written jointly with E. Kratzel, M. 
Kiihleitner, and W.G. Nowak was written after a suggestion from W. Schwarz. He 
had a vast mathematical culture, a clear and detailed style, and many mathematical 
subjects which interested him. Besides, he was a man of the world, a true gentleman 
indeed. He was also dedicated to mountain climbing, chess, and was a very fine 
musician; I am sure he could have made a very good professional career in music. 
Wolfgang Schwarz was a prolific author, covering many fields of number theory. 
The Mathematical Reviews list more than 110 of his publications. These include 


— Papers on arithmetic functions (multiplicative, additive, and particular) 
— Papers involving prime numbers 

— Papers on almost periodic and almost-even functions and related topics 
— Tauberian theorems (with applications) 

— Monographs 

— Historical papers 

— Textbooks 

— Other works 


The content of this paper covers roughly some of the topics mentioned above. 
This volume contains some other papers on the life and work of W. Schwarz, also 
presented at the Hildesheim ELAZ meeting in 2014. These are works by my friends 
and colleagues Indlekofer [15], Lucht [30], and Steuding [60]. These authors also 
recently published an obituary of Schwarz [16]. 

As to the quality of Schwarz’s work, one could perhaps mention the excerpt 
from the review [MR2405612 (2009d:11001), A. Laurin¢ikas] of Schwarz’s paper 
[45], “Some highlights from the history of probabilistic number theory”: “...is an 
excellent survey on probabilistic number theory, with principal results and methods, 
and respective comments, accompanied by a list of more than 250 references and 
by photos of some of the mathematicians mentioned.” 

The last time I saw W. Schwarz was during the meeting “ELAZ2012” (“Ele- 
mentare und Analytische Zahlentheorie,’ at SchloB Schney (Bavaria), August 
13-18, 2012. He gave there a brilliant talk on the famous mathematician Theodor 
Schneider (1911-1988), and the mathematical life in Germany in the 1930s. 
Without passing judgements, he described in detail (with photos, documents, etc.) 
the turbulent times of Germany in the 1930s and the different fates of some 
mathematicians. This was his last public appearance. He died on July 19, 2013. 
With his passing the number-theoretic community, especially the German one, has 
lost one of its most distinguished members. 


2 On Some Arithmetic Functions 


W. Schwarz has written numerous important papers on arithmetic functions, 
especially on almost periodic and almost-even functions and related topics (see, 
e.g., the papers [48, 51, 52, 55-57] and [39] for some of the more recent works 
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on this subject which Schwarz wrote alone or with other authors). This, however, 
falls outside the scope of this paper. Here we shall analyze some of his works on 
particular arithmetic functions, although many papers have to be omitted, like the 
works on “B-Zwillinge” (numbers n such that both n and n + 1 are a sum of two 
integral squares) [17] (with K.-H. Indlekofer) and [46]. We begin with some of W. 
Schwarz’s work from 1960 to 1970. 


2.1 The Euler Function 


In [40] W. Schwarz obtains asymptotic formulae for the sums 


Yi e(f@). >> e(f). 


nSx px 
where 
1 
g(m) =m I] 1- |= =: 1 
p\m F n<&m,(n,ym)=1 


is Euler’s (totient) function, p denotes primes, and 
f(n) = aon’ +--+ + ax (> 0) 


is a polynomial (primitive, i.e., (ao, @1,... , ax) = 1) with integer coefficients and 
discriminant D > 0. In the paper he has ten lemmas (one is the Siegel—Walfisz 
form of the prime number theorem) and six theorems. For example, he also proves 
(w(n) = >> 1 = number of distinct prime factors of m) the asymptotic formulas 


pin 
x x log log log x 
> o(f(e)) = [ loglogx + 0 —S 
Pee og x ogx 
6 
ba o(f(n)) = —a* loglogx + O(x), 
n&x,p(n)A0 


where the condition jz(n) 4 0 ((n) is the Mobius function) means that the last sum 
is over square-free numbers. One notes at once that there are several footnotes in the 
text. Footnotes are definitely a trademark of Schwarz’s mathematical style, and he 
was fond of using them! For example, on page 50 he has the definition of a strongly 
additive function as “Footnote No. 3”: 


3F(n) heiBt stark additiv, wenn fiir teilerfremde Zahlen nj, n, stets F(njny) = F(n) + 
F(nz) gilt und fiir jede ganze Zahl r > 1 stets F(p’) = F(p) ist. 
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3  Tauberian Theorems 


3.1 Habilitationsschrift 


Tauberian theorems (Alfred Tauber, 1866-1942) are (partial) converses of abelian 
[o.@) 

theorems (Niels Henrik Abel, 1802-1829). Namely, if the series )> a, converges to 

A, then Abel proved that 


n=0 


Tauber [61] in 1897 proved, if na, = o(1) as n — ox, then the converse is also true. 
The qualifying Tauberian hypothesis “na, = o(1)” can be relaxed to a, = O(1/n), 
say by Hardy—Littlewood [7, 8]. The Habilitationsschrift [41] of W. Schwarz uses 
Tauberian theorems of Hardy—Littlewood [7, 8], Hardy—Ramanujan [9, 10], Ikehara 
[14], Ingham [18], Karamata [26, 27], Martin—Pitt-Wiener (see [31, 34, 35]), and a 
host of other methods and results. 

For an extensive account on Tauberian theorems, see the monograph [28] of 
Jacob Korevaar. The Habilitationsschrift [41], in published form [42], is perhaps 
Schwarz’s most comprehensive and deepest work. It deals primarily with the 
evaluation of the summatory function 


YiF@) (3.1) 


nSx 


for multiplicative functions f(n) (f(mn) = f(m)f(n) for (m,n) = 1), under the 
assumption that 


>“ f(p) log p = Kxlog* x + R(x), 


px 


and R(x) (= “Restglied”’) is the remainder term. The asymptotic evaluation of (3.1) 
is one of the most fundamental problems in the theory of arithmetic functions. 

The case a < 0 was settled by E. Wirsing [62] in 1961 by intricate methods, but 
here also the (more difficult) case a > 0 is considered. Briefly, the multiplicativity 
of f(n) leads to the Euler product representation 


Sofa =[] (+f) +f(p?)p +---) (Res > 1). 


n=1 Pp 
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A Tauberian theorem of Hardy and Ramanujan [9, 10] gives first an evaluation for 


log | Erin} ; 


n&x 


Ingham’s Tauberian theorem [18] gives then )° f(n)/n. Finally an idea of E. Wirs- 
n&x 


ing produces an evaluation of )~ f(n). The assumptions are: R(x) = O(x),f(p) = 


n&x 


o(plog*! p), f(p") < yp"! *k-> for some 6 > 0 and k > 2. 


3.2 Applications of Tauberian Theorems 


The dissertation is essentially published in three important works [42], Part A, B, 
and C. Part B brings forth the asymptotic formula for 


1 
P(x) = Do ae" ela Ne 


so that the multiplicative function a(n) is the square-free kernel of n. It was shown 
by de Bruijn [5] (see also his monograph [4]) that 


log P(x) ~ V8(log x/ log log x)!/? (x > oo). 


The asymptotic formula for P(x) itself is harder! By using Ingham’s Tauberian 
theorem Schwarz obtained a true asymptotic formula for P(x). This is 


1 1 1/2 
PG) = (4nyvzrs( PEPE) Q(x), (3.2) 
logx 
where 
a Al 
Q(x) — Pre a anne 3 


One can put (3.2) into another form: as x > oo 


_ 1+ 0() (loglogx “e 
poy = EO( ioe ) exp(—R(logx)). 


where 


Ria) a 00/21 = “Ye Go), 
logu 


206 A. Ivié 


These problems involve a detailed study of the asymptotic behavior of the generat- 
ing function 


1 
:= Slog (1+ ——_ J, 
oo dX ox( apo) 


since 


F(s) = exp(g(s)),  F(s) = oo an = (Res > 0). 


n=1 


Several other results on a(n) are also proved, e.g., 


r= o(: raza ie 


nSx n&x 


In Part C [42] of his “Habilitationsschrift” Schwarz considers the so-called 
Mahler’s problem from the theory of partitions. Namely, let 


0<A, <dAd <-, y> 1 = Blog’ u+ O(), 


Ay<u 


where B > 0,0 < 6B < 1. For Res > 0 let 


Tt 1 — exp(— Ays)\ = exp($(s)), 


say. Further, let o,, denote the positive solution of —¢'(o,,) = u, and T(u) = $(o,)+ 
uo, and let P(u) denote the number of solutions of the inequality 


NyAy tnodAra te <u 


in non-negative integers n,. Schwarz proves that, as u — oo, 


P(u) ~ {27 B log’ uy? exp(T(u)). 


4 Distribution of Values of Finite Abelian Groups 


4.1 General 


W. Schwarz published the works [43] on a(n), the number of non-isomorphic 
abelian (commutative) groups of order n, plus the joint papers with Herzog [13] 
and Wirsing [58]. 
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The function a(n) is multiplicative and prime-independent, namely one has 
a(p*) = P(a), where P(a) is the number of unrestricted partitions of a € N, 
hence a(p) = 1, a(p”) = 2, a(p*) = 3, a(p*) = 5, etc. Let (p denotes primes) 


f(s) => on*=[[a-p ‘ty! (Res >), 
n=1 Pp 


be the familiar Riemann zeta-function, which possesses analytic continuation to C 
and is regular, except at s = | where it has a simple pole with residue 1. Then the 
generating series of a(n) can be written as 


co 


A(s) := a a(n)n * = €(s)C(2s)C(3s)... (Res > 1). 
n=1 
Therefore the Dirichlet series A(s) has poles at s = 1/1,1/2,1/3,..., so that one 


expects the summatory function A(x) := )~ a(n) to be well approximated by 
nx 


Sot ae [I] (5). 


j k=1,kAj J 


Indeed, W. Schwarz in Part II of [43] shows that 


6 
A(x) — }o gx'4 = o@'/e*) (4.1) 


j=l 


cannot hold for any given ¢ > 0. 


4.2 Results on A(x) 


If we set 


5 
A(x) = Soo? + A(x), 


j=l 


then the author [22] showed that 


»¢ 
/ A?(x) dx = 2(X* logX), 
1 
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where f(x) = 2(g(x)) means lim,-+o0f(x)/g(x) #4 0. This gives a slight 
sharpening of (4.1), namely 


A(x) = @ (x46 Viogx) 


In the other direction the author [21] showed that 
x 
/ A? (x) dx = O(X4 log? X) (a = 39/29 = 1.34482...), (4.2) 
1 


and Heath-Brown [11] obtained the mean square formula with a = 4/3 and b = 89. 
In view of the Omega-result for A(x) the exponent 4/3 in (4.2) is best possible. 


Open problems: is it true that 
xX 
i A? (x) dx = (C+ 0(1))X*3 (C > 0, X > cw)? 
1 


Can one evaluate the higher moments of A(x)? 


There is a rich history on upper bounds for A(x), starting from the seminal paper 
of Erdés and Szekeres [6] in 1935, who obtained A(x) = O(./x). Sharpening a 
method of Richert [37] who established 


A(x) = O(x7/! log?!” x), (4.3) 


W. Schwarz in Part I of [43] obtained (4.3) with the better exponent of x (stated by 
Schwarz as his “Satz 3” in this way!) 


3 7 
10 30-23 


H.-E. Richert actually conjectured that 
A(x) = o(x"4) (x > ov), (4.4) 


which is quite deep and does not seem to be within reach yet. All recent results on 
upper bounds for A(x) were obtained by estimating exponential sums connected 
with the error term in the asymptotic formula for )° 1 (see, e.g., Chap. 14 


ab? <x 


of [23]). 
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4.3 The Order of A(x) and a(n) 


The best known result on the upper bound for A(x), proved decades later in 2006, 
comes relatively close to (4.4). Namely, Robert and Sargos [38] proved that 


AG) = 040"); 
Here O, (and <, below) means that the implied O-constant depends only on ¢, 
which is an arbitrarily small, positive number. The crucial bound used in their proof, 
which is actually best possible, has an arithmetical flavor. It says that 


N <«, (N46 +N’) (6 > 0), 


where NV is the number of integer quadruples (m,n, k, 2) such that N < m,n,k,£ < 
2N and 


|J/m + J/n—Vk—V2| < VN. 
The maximal order of a(n) is considered in Schwarz and Wirsing in [58]. The 


minimal order of a() is trivial, since a(p) = 1 (Vp). Sharpening a result of E. 
Kratzel [29] they prove, as n > ov, 


1 
log a(n) < log5- li (J log) +O flogn . exp(—ci v log log n)} j 


where c, > 0 and 


Also, there are infinitely many values of n such that (cz > 0) 


1 
log a(n) = log5- li (; log) +0 {logn : exp(—e2 v log log n)} ; 


so that the upper and lower bound for a(n) are of the same form. Generalizations to 
multiplicative, prime-independent functions of these results have been obtained by 
Heppner [12], Nicolas [32, 33], and others. 
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5 Distribution of Values of the Omega Functions 


In [45], W. Schwarz investigated the so-called Rényi’s problem (Alfred Rényi, 
1921-1970, a well-known Hungarian mathematician after whom the Mathematical 
Institute in Budapest is named). Let K be the set of numbers n = p{''p5? ... p% such 
that a; +...+a,—r = 1, or equivalently 92(n) — w(n) = 1, where 


Qn= ya, on)= S01. 


p*||n pin 
Thus §2(n) is the number of all prime factors of n, multiplicities counted, while w(n) 


denotes the number of distinct prime factors of n. One also defines 2(1) = w(1) = 
0. Rényi’s problem consists of the estimation of the counting function 


K®= Yo 1s > ie 


nXSx,ne€K n&x,Q(n)—o(n)=1 


Rényi [36] in 1955 proved that 


6 1 
K(x) = 40x + 0G) («= Day soe). 


Cohen [1, 2] improved this to 
6 
K(x) = —ax+ O(x'/? log log x). 
sa 


W. Schwarz obtained, for any fixed r € N, 


r-1 


6 : 
K(x) = —ax+ at S> by log x + o( vt log’ *) : 
a logx = logx 


where the constants b;, are explicitly given. Dieter Wolke [63] in 1991 considered 
the more general problem of the estimation of 


Fy(x) := > 1, 


n&x,Q(n)—o(n)=q 
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where g € N is given (so that K(x) = F\(x)). His result improves the results of 
Schwarz on K(x). It says that for explicit d, > 0 (dj = 617), 


F(x) = dgx + 


Ce s P; 4 (log log x)(log x) 7! 


I<j<N(x) 


+0 fier exp(—b,N(x))} : 


Here N(x) = log!/? x(loglogx)~'/?. The polynomials Pjq(x) (of degree < g — 1) 
satisfy 


|P; (log log x)| < b3bi(j+ l)!glogx (b3 > 0, b4 > 0), 


which is an explicit bound. 


6 Schwarz on the History of Primes 


W. Schwarz has written many remarkable review papers, textbooks, and mono- 
graphs. For some of them, see [47—50, 53, 54, 56, 59]. I have particularly liked 
his textbook [47], “Einfiihrung in die Zahlentheorie.” It deals with many classical 
subjects of number theory, but Schwarz manages to present them in a novel and 
interesting way. His paper [50] on the history of primes is a real tour de force, 
analyzing superbly many topics connected with prime numbers. But let me refer to 
the review of the paper by D.R. Heath-Brown: 


MR1298643 (95h:11001) 


This is a comprehensive survey of the history of the prime number theorem and of 
developments from it. It makes inspiring reading, describing numerical investigations, a 
wide range of different proofs of the prime number theorem, and a whole range of other 
prime number theory, including primes in arithmetic progressions, sieve methods, and 
additive prime number theory. There are a number of photographs of researchers in the 
area, and occasional quotations from original papers, giving the article a down-to-earth 
feel. Overall this is an excellent read for any mathematician. 


7 Schwarz’s Influence on the Author 


W. Schwarz influenced me in more ways than one. We had many common interests 
in multiplicative number theory. I began corresponding with him in 1975, and the 
contact lasted even after he was seriously ill in 2012. The last e-mail message 
I received from him was a Christmas greeting in December 2012. Most of all, 
I remember his gentlemanly figure and his vast mathematical culture during the 
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many Oberwolfach meetings on “Elementary and Analytic Number Theory” at 
the Mathematisches Forschungsinstitut, which he organized for several decades 
together with H.-E. Richert and E. Wirsing. I owe very much to all three of them, 
especially to Schwarz, for being regularly invited there, as these meetings enabled 
me to meet most of the contemporary important research workers in analytic number 
theory, to mention just P. Erdés, Matti Jutila, and Y. Motohashi, with all of whom 
I have later collaborated extensively. I am taking the liberty to conclude this paper 
with two specific examples of W. Schwarz’s influence on me. 


7.1 Numbers of the Form pr’ 


Around 1975 I came to the paper of Schwarz [44] on the distribution of numbers 
of the form pr’, where p is prime, and r € N. I was a beginner in analytic number 
theory in those days. The proof of Schwarz’s result was not difficult, and the paper 
was written very clearly, so it immediately appealed to me. Let 


C= 1 


pr2Sx 


denote the number of integers of the form pr? not exceeding x. Schwarz used a nice 
elementary argument to show that, for a fixed integer k > 0, 


k 
C(x) = )° C,x(log xy"! + O (xogx)*”) , 


n=0 


where 
Co = 07/6 = (2), Cy = (—2)"6(2) + nCy-1 (n= 1). 


As soon as I read the paper, it occurred to me that Schwarz’s argument can be readily 
generalized to a somewhat more general situation. I wrote a short paper, which 
was published in the Polish periodical “Colloquium Mathematicum” (see [20]), the 
same journal in which Schwarz’s paper was published. It was, quite appropriately, 
reviewed by W. Schwarz himself, who in MR0498446 said verbatim: 

Using Dirichlet’s trick of splitting double sums carefully, the author proves a 
general theorem which gives an asymptotic formula for the number of integers n < x 
of the form n = a-b, where a € A,b € B, under the assumption that an asymptotic 


formula is given for the counting function B(x) = 1 and that an estimate 
b<x,beB 


A= SX) 1= 069 


asx,aeA 
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is known, where 0 < c < 1. As a special case this theorem contains the asymptotic 
formula 


k 
Y> 1 = 5 Cxlogxy "| + O (x(log xy) .” 


pr2Sx n=0 


It is fairly obvious that my work was not of a high quality, but Schwarz did not 
want to discourage me, so he did not express any explicit criticism. In fact, it was 
he who suggested that I submit the paper to “Colloquium Mathematicum,” and his 
advice was always sound and well-intended. 


7.2 Number-Theoretical Equations 


In [19] I solved a system of number-theoretic equations involving the number of 
divisors function d(n). I simultaneously published the problem in the “American 
Mathematical Monthly” as Problem 6108. This asked to find all multiplicative 
functions f(n) for which 


fo?) = Swe (5) 


d\n 


and 


fM=> fe). 


d\n 


W. Schwarz contacted me soon afterwards with some ideas on how to generalize 
this problem. This led to our joint paper [24]. 
Let 


(f x g)(n):= ) f(dgin/d), (fFlLam:= >> f(dgin/d), 


d\n d|n,(d,n/d)=1 


denote the Dirichlet convolution and unitary convolution of f(n) and g(n), respec- 
tively. These arithmetic operations were studied extensively in the 1960s and 1970s 
by many authors, e.g., see Eckford Cohen [1], [3]. 

Several number-theoretic equations involving Dirichlet and unitary convolution 
are discussed in [24]. One of the basic problems is the analysis of the equation 


fi =2° xf  (k=1,2,--°). eA 


21 
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If k = 1, the only non-negative solution of (7.1) satisfying f(1) = 1 is the 


divisor function f(n) = d(n). If k = 3 is odd [resp. k = 2 is even] there exists 
exactly one real-valued [resp. non-negative] function f satisfying (7.1) and f(1) = 


i. 
F( 


In both cases the solution f is multiplicative and prime-independent (meaning 
p*) = g(a), in other words f(p*) does not depend on a). The analogues for the 


unitary convolution are also discussed in the paper. 
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Abstract This article determines the order of magnitude of integers not exceeding 
x that can be written as sums of two squares of integers that are themselves sums of 
two squares. The tools include Selberg’s sieve and contour integration in the spirit 
of the Selberg-Delange method. 
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1 Introduction 


In about 1900 Landau calculated the cardinality of sums of two squares in a long 
interval [1, x]. With S = {n € N | n = a? + b*} he proved in [3] the asymptotic 
formula 


Xx XxX 
= (s+ °( ware): a 


nsx 
n€S 


1 —2)-1/2 
where C := a T],=3 1 -—p~) a 
It is possible to diversify the summation condition in a lot of ways. In this paper 
we want to consider the situation where each of the two squares is again a sum of 
two squares. 
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We define some notation. Let S = {n € N | pln > p ¥ 3(4)} C S, and let 
R(n) = Ha,beS|n=a+b*}, 
R(n) =#H{a,beS|n=a +b. 


Finally we write 


Six) => R(n)! 


nsx 


for i=0, 1,2 with the convention 0° =0, and analogously 


Six) = SOR. 


n<x 


Our aim is to estimate these quantities from above and below. We start with the 
following which can be obtained from (1.1) by partial summation. 


Theorem 1.1 For x > 3 we have the asymptotic formula 


o@) => [] a-p?y" 


x 
+0(——..). 
Ba logx (log x) 


_ An analogous asymptotic formula (with a different constant) can be proved for 
Si (x). : 
Our next aim is an upper bound for S2(x). As a preparation we define the function 


ys(n) =| [(1-p~). (1.2) 
pin 


Then we prove the following 
Theorem 1.2 Let gj, 1; € Z forj = 1,...,k with Q = [[q[] lair; — gril 4 0. 


J Fj 
Then 


#in<x|qn+r¢€SVj=1,...,B 


x T(k/2+ 2" 1 WeO. - xe 1 
< Goen'? FA) y*(Q) (: One + iQ) =) 


for a constant 5' > 0 and F*(1) given by the convergent Euler product 


k 


(1 -)" TT (0+ DO )a-ey" 
Pp P p=3(4) 1=2 
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This furnishes an upper bound for numbers such that k arithmetic progressions 
are not divisible by a prime congruent to 3 modulo 4. 

To prove this theorem we use Selberg’s sieve which requires a calculation that is 
reminiscent of the Selberg-Delange method. Having this bound available, we will 
prove the following theorem. 


Theorem 1.3 For x > 3 we have the upper bound 


Bw «KH. (1.3) 
log x 


We see that S,(x) and S(x) have the same order of magnitude, thus with the 
Cauchy-Schwarz inequality we get a good lower bound for So (x). Furthermore we 
have the inequality So(x) < So(x), so we get a lower bound for So(x). This gives us 
our final result. 


Theorem 1.4 For x > 3 we have 


1 
ee OO Sp Ole) 


withCo =4% [TT (l-p™’)t. 
p=3(4) 


In other words, the number of sums of two squares of sums of two squares is of 
order of magnitude —. 
sx 
2 Useful Lemmas 


First we will prove three lemmas which are useful to prove the theorems. 


Lemma 2.1 For fixedn,m € N we get 


vx m 

/ t “( Vx— 0? Jat J21 x (1+ 0( 1 )) 

y Jlogt” dt\ /log(x — 2)" logx//" 
Proof Let X = aoeere We consider the two integrals over [0, X] and [X, ./x] 


separately, and denote them by /,(x) and I,,(x), respectively. 
In the integrand of J,(x) the derivative is bounded by 


f(a ) Ay = 
dt\ flog —P)" Vx—X2/loge—X2) 0 Vx —X? 
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Using this and the estimate =r < X we obtain 


J logt 


Xx 
——— dt K — 


xX 
I. ———_—_——.. 
0«< | ae < F< aes 
0 


For the integrand of J,,(x) we use Taylor’s formula around t = ./x 


as, sa +0( t— /x ) 
Jlogt” Jlogx” JxJlogx”** 


For the integral over the error term we compute the derivative term and use trivial 
bounds for the factors 

Ji 
x 


— ft)? 1 
__ (ve ot ) _ log(x — 2 ie _ Ss <K a 
Vx— 2 Jlog@ — 2)" /x/log x Jlogx 


Pete : 2 
In the remaining integral for the main term (2 of Taylor’s formula we 
expand the lower bound to zero. With partial integration and the substitution y = 


x — f° we get an integral similar to the one above, whence 


Vx x 
d Vx — 2 _ Jy 
fae) * | rer? Ga) 


We need to compute the integral. Again we split it into two parts [0, Z] and [Z, x] 
with Z = TonayFT and repeat the estimates to get 


WA vy 1 
| Sar? Jlogx” Jo aol! + OG): 


dy can be computed exactly with the result = 


The remaining integral fied Wi 


and our claim follows. 


Lemma 2.2 The map 


{(k,1,m,n) € N* | kl = mn} <=> {(s,t,u,v) € N* | (s,u) = 1}, 


m — n(k,m) 
(hmm) > (7 a ”): omy" ; i} 


(st, uv, tu, sv) <4 (s,t, u,v) 


is a bijection. 
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The proof is straightforward. 
Lemma 2.3 Let k € N ands > 1/2. Then for x => 3 we have 


1 
> —@, *(n) < logx. 


nsx 


Proof The case s = | and k = 1 was proved by Landau (c.f. [2]). Note that ng, (n) 
is Euler’s phi EPBEHON, 


The expression a Hy can be rewritten as 
1 1\-4 1 (d) 
— = | 1-— = | 1+ = ) 
ys(n) pin ( p p\|n ( P* = i) d\n d*Qs(d) 


where jz is the Mobius function. We insert this in our summation to get 


Lae Esl aaa) 


nsx 

=e POG A > 2 
apex di ps(d1) +++ des (de) lem; (di) noms m 
jal re k = Tem; (dj) 


«y 1? (di) ++? (de) 
c= dips (d)) +++ di@s(dx) | lem; (dj) 


log x. 


We complete the remaining sum into an infinite sum and see 


— LL (dy) +++? (dy) 1 1 k 
a Festa Tem;(d) IT( ia) 


This product converges absolutely for s > 1/2. 


3 Numbers in S in Arithmetic Progressions 


In this section we prove Theorem 1.2. For simplification we assume k > 2, since in 
the case k = 1 we can apply the proof of (1.1) with a different constant. In addition, 
if (q;, r;) has a divisor which is congruent to 3 (mod 4) there are no solutions, hence 
the inequality is true. If (g;, r;) has a prime divisor p = 1 (mod 4), we can simply 
divide it out without changing the condition gjn + r; € S. Hence we assume without 
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loss of generality that (g;,7;) = 1. We use Selberg’s sieve, c.f. [1, Theorem 6.4, 
p. 10]. Let 


P=|[p and 


p=3(4) 
k<pxx 


O(t) = (qit +171)... (Get + rx). 


Then the cardinality of {n < x | (Q(n), P) = 1} is an upper bound for the quantity 
we want to bound. 

Consider the function p(d) = #{a (mod d) | Q(a) = 0 (mod d)}. We see that 
p(p) < min{k, p} for p | P. In particular it is p(p) = k if (p,Q) = 1. By the 
Chinese remainder theorem, p is multiplicative. Hence we get 


Yo 1 = Sold) + Ou). 


nsx 


d|Q(n) 


where t; is the k-th iterated divisor function i pegs 


We define the multiplicative function g by g(p) = = e ; 
g(p) = 0 for p + P and g(p”) = 0 for all p and all v > 2. For a parameter z to be 


chosen later let Z = )° g(d). Therefore by Selberg’s sieve 
dz 


for primes p | P, 


© 15F+0( Lan). G.1) 


d<2 
PIP 


nsx 
(Q(n),P)=1 
Hence we need to find a lower bound for Z to get the information we want. 
Writing 


Pio 
p=3(4) p=3(4) 


it is easy to see that Z > Z’. 

For the sum Z’ we use Perron’s formula to find a lower bound. It will be useful 
to use a refined version due to Liu and Ye [4]. Consider the Dirichlet series of Z’, 
given by the Euler product 


D(s): = I] (1 4 (1 + ey Il (1 4 oe) 


pea) past) 
p>k 


= Di(s)- (s+ IL ¥7(s + 1, x-«) (5), 
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where 
g(p) Tr-1(P) \-1 
D'(s) => I] (1 + pos + pl ) 
pio 
p=3(4) 
p>k 
The function H(s) is holomorphic and nonzero in Res > —>. The finite product 


D’(s) also is a holomorphic function and bounded by 


1 
es(Q) K |D'(s)| < = 
rae (Q) | (s)| Pri pe MQ)” 


where Q, is as in (1.2). Define F?(s + 1) = €(s + I)L7'(s + 1, y-4)H?/*(s)s, then 
F is holomorphic in the zero-free region of the L-function as in [6, II §5.4] and we 
get D(s) = F*(s + 1)s~*/?D"(s). 

Let 2 < T < z bea parameter which we choose later. For the error terms in 
Perron’s formula we need to bound the quantity 


1 
» gin) « » Ww (n)t%—1(n) I] — = 
k-als FR k-als a oe P a P 
k-2 
«KoQ) em ao) < g7t(Q foe 


JT 


k-als 


The last bound follows by [5, Theorem 2]. The second part of the error term is given 
by 


2B) Jal? C+ ce) — (logz)k 
JT ae rer ar ie 


where we choose b = 
By applying Peon" s foruiilas in the version of [4, Corollary 2.2] we get 
peti iT 
1 FK(s+1 _ log(z))*-2 log z)* 
ist i c+ one + 0(prt(q Se , tesa) 
gk/2 JT JT 
—-_ir 


(3.2) 


Now we integrate over the same contour as in Tenenbaum [6, II §5.4] but shifted to 
zero, So that the order of the singularity at 0 is f +1. 
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We split the contour into various pieces 


— be = r], 
ey [oF Dog@4+T) | 
= -——_ + tt, for t € [—T, 0], t € [0, T], 
Y2> Yo Dlog(2 + |A) [ ] [0, T] 
v4 = 0B,(0), 


=[-spar~] 
a Wy ane 


[-"-s5l 
=| —-r,-——— }, 
Ea 2log2 


with r = _ and 6 > 0 is chosen so small that 


1 
—— < log |t|, s) < log |r|, 
TD g|d| a g || 


foro >1— aE (see [6, p. 262]). 
For the parts yj with i = 1, 2,6,7 an upper bound for D’(s) is given by the 


function 9;- tO) with 6’ = We get for y; and analogously for y7 the bound 


aT 


1 
logz 
Fe(s +1 log T)* do 
i ( pie « SeaDt fl 2 
stl? Pin 5(Q) r 
Y1 me 
2log(2-FT) 


(3.3) 
— we re 7 = = 
K 9125 (Q)——— z do & g2,(Q)X——— 


For the parts yz and y¢ the bound is given by 


[Prot 


gk/2 
y2 


~ it|'at (3.4) 


<K o7F,(Q) (log T)*z 4/2 08D) Jig la ToT 


_ <psioan 2donT) 
< Qi 2k, (Q)z ae i eee Te 
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We choose T = exp(4 ./log z) to optimize the error term composed from (3.2), (3.3) 
and (3.4). 

By writing Y’ = y3yays for the contribution not yet considered we get with the 
bounds from (3.2), (3.3) and (3.4) the asymptotic formula 


, 1 FX(s + Wigg get —2k —k (log z)é 
= Sa —azP (s)z ral (2 5(Q) + g; “(Q)) —). 
Y’ 
(3.5) 


The remaining integral does not have an explicit solution, but by using the Taylor 
expansion 


F*(s + 1)D'(s) = F*(1)D'(0) + (vi) log Q- s\) 


we obtain 


1 F¥(s + 1) _ds 
ey pe a S| 
| gk/2 ()z s 


Y’ 


= brass fseS 5] + O(et@rvse | HeF) = n@ the. 


Y’ 


The product D’(0) is bounded from below by pk (Q). Now we change /;,(z) by 
the substitution s = =i into the form of Theorem 5.2 from [6, II §5.2]. Let Y be 


the resulting contour from Y’. Applying Corollary 2.1 of [6, Il §5.2], for Hankel’s 
contour we get 


1 ae 
tne) = PAH Q)(og2)*?|— ftw 4?+Yan) 
2. 


= F(I)g{(Q)(log2)"?( + O(<")), 


1 
DP(k/2 +1) 
for some v > 0. 

To compute the integral /,(z) we use trivial bounds on the circle and on the lines 
separately to get Ip < gj 7 *(Q) log O(log z)*/2-!. Combining the obtained results 
we get a lower bound for Z’ and hence for Z. The error terms from Perron’s formula 
and from the other pieces of the contour (3.5) combine with /,(x) to give 


(log z)* 


—k k/2— ve —2k —k 
e*(Q) loz QMlog 2)" + (C1 D+ 9") ETD 36 


<K (v7; *(Q) log Q + 974,(Q)) (log z)*/7". 
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The error terms in Eqs. (3.1) and (3.6) are optimized by choosing z = oo with 


(log x) 
k 
d= 4(4 + 1+ 3) and we get 
#in<x|qnt+nESVj=1,...,8 


-_ 2% D(k/2 + 1)2*/? ba o((SE2 +¢ 
~ (log x)*/? F*(1) gy} (Q) yi (Q) 


FH) Gx): 


4 Proof of Theorem 1.1 


In this section we find an asymptotic formula for S; (x). 
Let ro(n) be the characteristic function on the numbers that can be written as a 
sum of two squares. By rewriting the sum in terms of 7o(7) and (1.1) we get 


Sie) = D> ro(@C a 


14+ 0O((1 - 
a ——( + O((log(x — a))~ ): 


By applying partial summation and once more (1.1) we get the integrals 


Vx—# 

2 

Si@)=-C als Tata ia) 
Vx— a 1 


+ 20 ne =e) — (+ log(x — a2) 


vol t ( Vx — 0? )ar-+ a t d Vx— )a) 
3 dt = Jlogt dt 3 
Jlogt 4 \./log(x — t) ‘ ogtdt\ flog —P) 


=:M(x) + 0+ E(a). 


) 


. . . . J x—a2 2, 
The boundary expression vanishes since lim —~=“—(1 + (log(x—a 
y exp Ly Teo, (log(x—a’))“!) = 
With Lemma 2.1 at our disposal the computation of M(x) and E(x) is simple. 
For M(x) we have m = n = | and hence 


-_ x x 
2 
MG) = C2 — log x a O(aoenr): 
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The error term E(x) has two integrals, where in the first one we have m = 3, 
n = | and in the second one m = 1, n = 3. Since Lemma 2.1 is symmetric in n and 
veo . 
mit yields E(x) << Tosn? 
Combining these two terms we conclude that 


= 2 Xx Xx 
Sia) = CST + Coos (4.1) 


The asymptotic formula for S;(x) is almost the same as S;(x), and for x > 3 we get 


Xx 


logx 7 Oran): 


ia) = + [] a-p%) 


p=3(4) 


5 Proof of Theorem 1.3 


The upper bound of S> (x) requires a bit more work. Opening the square we have 


So(x) = ~ 1. 


a,b,c,deS 
a2 +b? =c? +a? <x 


We need to understand the summation condition a? + b? = c* + d?. The diagonal 
term is S| (x) which we have bound above. Hence we can assume that {a, b} 4 {c, d} 
and the condition a” + b* = c* + d? is equivalent to (a+ d)(a—d) = (c+b)(c—b). 
Using Lemma 2.2 this can be rewritten with s, t,u,v € N with (s, uw) = 1 as 


a+d=st, a-d=uv, ctb=tu, c—b=sv. 


Without loss of generality we can assume that a > b and c > d, then the condition 
a,b,c,d € S changes to st + uv, st — uv, ut + sv, ut — sv € S. It follows that 


Sw<s YooL 


st,uv,ut,sv<2 JX 
stkuvutsvEeS 


Now we want to apply Theorem 1.2. We fix u, t, v and think of s as our variable. In 
accordance with Theorem 1.2, we define Q = tu(t— v)(t + v)(t? + v’). 

In the case f = v our conditions imply that f(s + u) > 0 and t(u +s) > 0 which 
is obviously impossible. 

In the following let t 4 v, so that QO ¥ 0. Isolating the sum over s we get 


your ys 


STUD Ut,SV<2/% tu,uv<2./x s<min{ uf 2Sx 2Jey 
= hr 


stuvutesvEeS 


stkuvutsvEeS 
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We use Theorem 1.2 to calculate the inner sum, and obtain 


es log Q 


s<min{ 4 
stacuv.ut Ewes 


with 6’ defined in Sect. 3. For the sum over t, u, v we get the condition v < f, since 
“2 < “. Furthermore in the term 9} 4(Q) we split the product Q. To this end a 
short look at the definition shows that g,(nm) > @,(n)@,(m) and with the Holder 
inequality it follows that 


ut 1 1 1 1 1 1 4 
Py v (log #)2 Co giv) gi(t—v) Git + v) G(? + 5) 


uv<ut 


ut<2./x 
5.1) 
ut = 1 1 4/5 u r(n) \1/5 ( 
<(L mere) (La sa) 
wan, VU (log >)” gy (v) (log u)* gy (n) 
ut<2./X% ae 


where r(n) counts the number of solutions a? + b? = n. 
: ut)\—2 —2 ‘ 
First we compute the sum over v and use (log =) < (logu) ~~. Using 
Lemma 2.3 to compute the remaining sum over v we get 


2 LS v ed 


u 
« — ftlogt. 
ut<2.fz VSt P) a », (log u)? 


With partial integration we get for the sum over tlog ¢ the upper bound 


2 
> tlogt<« * (og 2% - 1). 
Uu u 


pce 


There remains the sum over u for which once again we use partial summation. We 
obtain 


1 2./x 1 
x u(log u)2 me 2 > logx 
u<2,/x 8 8 
Hence the first factor of (5.1) is computed. 
Using r(n) = ak x-4(d) the second factor of (5.1) can be transformed 


similarly as in Lemma 2.3 and it follows that 


r(n) x 
De ge (n) « a2: . 
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And the sum over u can be computed as above to get 


y Uu r(n) x 
Ae, (log)? gp(n) ~ log: 
nay 


Combining these two results we get 


ut 1 1 x 


——_—__. {— « : 
cme ¥ (log F) g{(Q) ~ logx 
ut<2./X 


log Q 
9} (Q) 


For the sums over and 9 ;_ tO) the argument is the same and it follows in 


the same way that 


ut 1 logQ ut 1 logt x 
y aaa ae ) — —____—_ < 
W)3 (8 uy3 8 
uv<ut<2,/x : (log 7) a (Q) uv<ut<2./% v (log =e ¢}(Q) log x 


and 


1 
2 = 978s (Q)~,4(Q) « 


= X 
Uv uty3 7A WA?” 
uv<ut<2./x : (log = (log x) 


Combining these two results we get the final bound 


E x 
S2(x) K —. 
logx 


6 Final Result 


Theorem 1.4 is now a consequence of the Cauchy-Schwarz inequality. We have 
~ ~ 1/2 ~ 2 : 
Si) = (DOR) (YOR?) = So@)"?S)"”. 

nsx nsx 


Thus 


e2 
Su(s) > Sots) = 2. 


By replacing $(x) with (1.3) and Sj (x) with (4.1) we get the lower bound stated in 
Theorem 1.4. 
On the other hand we have So(x) < S,(x), which yields the upper bound. 


230 R.U. Jakob 


Acknowledgements I am very grateful to Prof. Valentin Blomer for suggesting this topic to me 
and his constant advice and encouragement. 


References 


1. H. Iwaniec, E. Kowalski, Analytic Number Theory. American Mathematical Society Colloquium 
Publications, vol. 53 (American Mathematical Society, Providence, 2004) 

2. E. Landau, Uber die Zahlentheoretische Funktion y(n) und ihre Beziehung zum Goldbachschen 
Satz. Nachr. Ges. Wiss. Goettingen, Math.-Phys. KI. 1900, 177-186 (1900) 

3. E. Landau, Uber die Zerlegung der Zahlen in zwei Quadrate. Annali di Matematica 20(1), 1-28 
(1913) 

4. J. Liu, Y. Ye, Perron’s formula and the prime number theorem for automorphic L-functions. Pure 
Appl. Math. Q. 3(2), 481-497 (2007) 

5. P. Shiu, A Brun-Titschmarsh theorem for multiplicative functions. J. Reine Angew. Math. 313, 
161-170 (1980) 

6. G. Tenenbaum, Jntroduction to Analytic and Probabilistic Number Theory. Cambridge Studies 
in Advanced Mathematics (Cambridge University Press, Cambridge, 1995) 


The Joint Discrete Universality of Periodic 
Zeta-Functions 


Antanas Laurin¢ikas 
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Abstract In the paper, a joint discrete universality theorem on approximation of a 
pair of analytic functions by shifts of periodic zeta-functions and periodic Hurwitz 
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1 Introduction 


Let s = o + it be a complex variable, and a = {a,: m € N}andb = {b,: meé 
No = NU{0}} be two periodic sequences of complex numbers with minimal periods 
k € N and/ E€ N, respectively. The periodic zeta-function ¢(s; a) and periodic 
Hurwitz zeta-function ¢(s,q@;6) with parameter a, 0 < a < 1, are defined, for 
o > 1, by the series 


cer ie b 
sia) = —" and s,a;b) = a 
f(s; a) a f(s, a; b) dX aaa 
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Let €(s, a) denote the classical Hurwitz zeta-function which is for 0 > 1 given by 
ae 


f(s,a) = Dy rer 


m=0 


and can be analytically continued to the whole complex plane, except for a simple 
pole at the point s = 1 with residue 1. In view of the periodicity of the sequences a 
and b, we have that, foro > 1, 


&(s;a) = = 3 ang (s. =) 
1 


m= 


and 


I-1 
£(s,0;6) = . Deal (s. 7 - “) 


m=0 


Thus, by the above remark on the function ¢(s, a), these equalities give analytic 
continuation for the functions ¢(s;a) and ¢(s,a@;6) to the whole complex plane, 
except for a hypothetical simple pole at s = | with residues 


1 ( 
a=7))an and b= >) ibm, 


m=1 m=0 


respectively. If a = 0 or b = 0, then both functions €(s; a) and ¢(s, a; 6) are entire. 
Clearly, ¢(s;a) is a generalization of the Riemann zeta-function, and ¢(s, a; 6) 
generalizes the Hurwitz zeta-function. 

In [3], a joint universality theorem on the approximation of a pair of analytic 
functions by shifts ¢(s + it;a) and ¢(s + it,a;6) has been obtained. Let D = 
{s eC: 5 <o< Ve Denote by K the class of compact subsets of the strip D with 
connected complements, by H(K), K € K, the class of continuous functions on 
K which are analytic in the interior of K, and by Ho(K), K € K, the subclass of 
H1(K) of non-vanishing functions on K. We remind that a sequence of numbers a, is 
multiplicative if a; = 1 and aj, = aa, for all coprime integers m,n. Now we state 


the main result of [3]. 


Theorem 1.1 Suppose that the sequence a is multiplicative, and that the number a 
is transcendental. Let K,, Ky € K, fi (s) € Ho(Kj), and fo(s) € H(K2). Then, for 
everyé > 0, 


1 
lim inf —meas 4 t € [0,T]: sup |€(s + it; a) —fi(s)| <, 
Too T seKy 


sup |C(s + it, a; b) —fo(s)| < e} > 0. 
SEK 
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Here measA denotes the Lebesgue measure of a measurable set A C R. Theorem 1.1 
is a generalization of the Mishou theorem proved in [9] for the Riemann zeta 
and Hurwitz zeta-functions. Theorem 1.1 is the so-called continuous universality 
theorem because analytic functions are approximated by the continuous shifts 
f(s + it; a) and ¢(s + it,a; 6), t € R. The aim of this paper is to prove a discrete 
version of Theorem 1.1 where analytic functions are approximated by discrete shifts 
f(s + ikh; a) and €(s + ikh,a;6), or, more generally, by shifts €(s + ikh,; a) and 
€(s + ikhz, a; 6), k € No, with fixed positive numbers h, h,, and hp. 
Denote by ? the set of all prime numbers, and define the set 


L(P,a,h, 1) = {\(logp : p€ P), (log(m+ a): me No), =} : 
Theorem 1.2 Suppose that the set L(P,,a,h, 1) is linearly independent over the 


field of rational numbers Q, and that the sequence a is multiplicative. Let K,, Ky € 
K, fils) € Ho(K}), and f2(s) € H(K2). Then, for every ¢ > 0, 


1 
lim inf locksw: sup |¢(s + ikh; a) —fi(s)| < e, 
Noo 1 seKy 
sup |C(s + ikh, a; 6b) —fo(s)| < €> > 0. 
SEK) 
Now let 


L(P,a,hi,h2,27) = {(hi logp: p €P), (mlog(m+ a): me No), 7}. 


We note that this set consists of all elements of two sequences in addition with the 
number z. Then we have the following generalization of Theorem 1.2: 


Theorem 1.3 Suppose that the set L(P, a, h,, hz, 2) is linearly independent over 
Q, and that the sequence a is multiplicative. Let K,, Ky € K, f\(s) € Ho(K,), and 
f2(s) € H(K2). Then, for every ¢ > 0, 


lim inf 
Noo 


1 
phyOSkSN: sup |e(s + skh; 0) — Ail <e, 
sEKy 


sup |C(s + ikhz, a; 6) — fo(s)| < ef > 0. 


SEK, 


Obviously, for hy = hz = h Theorem 1.3 reduces to Theorem 1.2. 

By the Nesterenko theorem [11], the numbers wz and e” are algebraically 
independent over Q. Therefore, the set L(P, a, h1, hz, 7) witha = x! and rational 
hy, hz is linearly independent over Q. 
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Professor Wolfgang Schwarz was a mathematician with a broad outlook. Among 
other problems of probabilistic and analytic number theory, he was also interested 
in universality of zeta-functions, and this is confirmed by the paper [7]. 


2 Two Lemmas 


For the proof of Theorem 1.3, we will apply the probabilistic approach based on a 
limit theorem on weakly convergent probability measures in the space of analytic 
functions. In this section, we present two lemmas which play an important role in 
the proof of such a limit theorem. 

Lety ={seC: |s| = 1}, and 


Qi = I] Yp and 22 = I] Vins 


pEeP meNo 


where y, for all p € P and y,, = y for all m € No. By the classical Tikhonov 
theorem, with the product topology and pointwise multiplication, the tori (2; and 
$2, are compact topological abelian groups. Therefore, denoting by B(X) the Borel 
o-field of the space X, we observe that on ({2;, B({2;)) the probability Haar measure 
mj can be defined, 7 = 1,2. Denote by @;(p) the projection of w@; € 22; to 
the coordinate space y,, p € P, and by w2(m) the projection of w, € £22 to 
the coordinate space y,,, m € No. Then @)(p), p € P, and a2(m), m € No, 
are complex-valued random variables defined on the spaces ({2;, B(1), m7) and 
(22, B(822), m7), respectively. 

We define one more set by 2 = 2; X 2). Then Q is also a compact 
topological abelian group. This leads to the probability space (2, B(2), my) with 
the probability Haar measure my. We denote by w = (@ 1, @2) the elements of 2, 
where w; € §2;,j = 1,2.. 

Next, for A € B(S2), let 


oe 1 , 
On(a) = 4 {O<k<N: ((p*": peP), 
((m+ a)“ : me€No)) € A}. 


Lemma 2.1 Suppose that the set L(P,, a, h,, hz, 1) is linearly independent over Q. 
Then Qn converges weakly to the Haar measure my as N — oo. 


Proof We apply the Fourier transform method, and use similar arguments as in the 
proof of Lemma 7 in [3]; recalling the proof of this lemma, we observe that the 
Fourier transform gy(k,!), k = (ky: ky €Z, p€ P),L= (In: In € Z, me No) 
of the measure Qy is of the form 


ent. = f (TL oPt T] ofan } ax. 


pEP meNo 
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where only a finite number of integers k, and /,, are distinct from zero. Thus, by the 
definition of Oy, 


enlk, 1) = Tew =_2 5 |p —ikkphy I] (mm + ox) 


k=0 peP meNo 
Sen my exp 4 —ikh, )~ ky logp — ikhy Y > Im log(m +a) ¢ (2.1) 
pEP méNo 


where, as above, only a finite number of integers k, and 1, are distinct from zero. 
Obviously, 


gv(0,0) = 1. (2.2) 


Before to consider the case (k, 1) 4 (0,0), we observe that, in this case, 


exp 4 —ih ) > ky logp — ihr Y° Imlog(m +a) ¢ #1. (2.3) 


pEP méeNo 
Indeed, if inequality (2.3) is not true, then, for a certain m € Z, 
exp —ih, > kp logp — iho > li log(m 4 a) = erin 
pEP méNo 


Hence, 


hy Sky logp + 0) ye In log(m + a) = xl 


pEeP méeNo 


with / € Z. Since only a finite number of integers k, and J, are distinct from zero, 
this contradicts the linear independence of the set L(P, a, h;, ho, 1). Thus, (2.3) is 
true. Therefore, in the case (k, 1) 4 (0, 0), using the formula for the geometric series, 
we obtain 


Ie are (my Dd kplogpth, \- itoginta)| 
gw(k,D = = _ 


orr(1-o| -i{h Dd ky logpthy YS In lett) ) 
peP 


mENg 
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From this and (2.2), we find that 


_fl if &)D=@O0), 
wm sri D= 10 ie (k,D A (0,0). 


Therefore, a continuity theorem for probability measures on compact groups proves 
the lemma. 


The next lemma uses again the linear independence of the set L(P, a, hy, hz, 77) 
over Q, and is devoted to ergodicity of a certain transformation on the space 
(2, B(82), my). 

Define the element dg pj, 4. € §2 by the formula 


Gann = {(p"™" : pe P), (m+ ay™ : meNo)}, 
and let, for w € 2, 


Pohy sha (w) = Gah, shy ® - 


Since the Haar measure my is invariant with respect to translations by points from 
§2, we have that @u.,n,,n, is a measurable measure preserving transformation on the 
probability space (2, B({2), mz). We remind that a set A € B(S2) is called invariant 
with respect to the transformation @ ;, ;, if the sets A and @y n,n, (A) differ one from 
another at most by a set of zero my-measure, and that the transformation @g,n,,15 iS 
ergodic if the o-field of invariant sets consists of sets with my-measure 0 or 1. 


Lemma 2.2 Suppose that the set L(P,,a, h,, hy, 1) is linearly independent over Q. 
Then the transformation Qy,hy,hy 1S ergodic. 


Proof We know (see the proof of Lemma 11 of [3]) that characters y of the group 
92 are of the form 


xo) = [] o(») T] orm, 


peP méeNo 


where only a finite number of integers k, and /,, are distinct from zero. If y is 
non-trivial character of 2, then, from this and inequality (2.3), it follows that, for 


(k,D # (0,0), 


X(da.jn,.n) = exp —ihy D> ky logp — thy Y> Im log(m + a) ¢ # 1. (2.4) 


pEeP. m€ENo 
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Let A be an invariant set of the transformation @ n,,;.. Denote by J, the indicator 
function of A, and by f the Fourier transform of the function f. Then 


ina) = J xleosa(co)mn (deo) = (0. 0)40(0) 
since the measure my is invariant, and, for almost all w € 92, 
Ta (Ge,hy.t.®) = Ta(@). 
This and (2.4) imply that 
Ia) =0 (2.5) 
for every non-trivial character y of 92. 


Now let 7o be the trivial character of (2, i.e., ¥o(@) = 1. Suppose that I(x0) =a. 
Since 


1 if x=%, 
@)my(d@) = ; 
[xeon ) : os 


we have in view of (2.5) that, for every character y of the group £2, 


ioe iL (onan a. (2.6) 


It is well known that the function J4(@) is uniquely determined by its Fourier 
transform Ih(o). Therefore, we have from (2.6) that J4(@) = a for almost all 
@ € 92. However, since J4(@) is the indicator of A, a = 0 or 1. Hence, J4(w) = 0 for 
almost all w € 9, or I4(@) = 1 for almost all w € 92. This shows that my(A) = 0 
or my(A) = 1, ie., the transformation @y n,n, is ergodic. 


3 A Limit Theorem 


Denote by H(D) the space of functions which are analytic on D, equipped with the 
topology of uniform convergence on compacta, and, for A € B(H?(D)), define 


1 
Py(A) = Wait Sk SN: &(s + ikh, aa, 6) € Al, 


where h = (hy, hz) and 


O(s + ikh, a; a,b) = ((s + ikhy; a), (8 + kha, a; b)). 
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Moreover, on the probability space (2, B(2), my), define the H?(D)-valued ran- 
dom element ¢(s, a, @; a, 6) by the formula, see [3], 


O(s, a, w; a,b) = (f(s, @1; a), C(s, a, @2; b)), 


where 
CO 
Am@|(m) 
,@1;a) = )  ——— 
with 
orm) = |] of(p). 
prim 
ptt tm 
and 


€(s,@,@2;6) = » Drmto (mn) 


m=0 (m + ow) 


We note that the series for €(s,@,;a) and ¢(s,a,@2;6) converge uniformly on 
compact subsets of the strip D for almost all @, € 2; and wz € £2, respectively, 
and thus they define H(D)-valued random elements on the probability spaces 
(2;, B(22)), mz) and (22, B(22), mz), respectively. Moreover, in view of mul- 
tiplicativity of the sequence a, we have that for almost all @, € £2; ands € D, 


&(s,a@1;a) = I] (143 4 3 Ak} sect). 
pEeP 


Let Py be the distribution of random element ¢(s,a,@;a,6), ie., let Pe be a 
probability measure given by 7 


P(A) = my (« € 2: C(s,a,0;4,b) € A) , A€ B(H?(D)). 
Now we are ready to state a limit theorem for Py. Let 
= {g € H(D): g(s) £ 0 or g(s) = O}. 
Theorem 3.1 Suppose that the set L(P,a,h,, hz, 1) is linearly independent over 


Q, and that the sequence a is multiplicative. Then Py converges weakly to Pr; as 
N — o. Moreover, the support of P¢ is the set S x H(D). ~ 
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Proof The proof is similar to the one for Theorem 6 from [3], therefore, we do not 


give all details. 
For fixed o; > 5 and m,n € N, let 


v(m, n) = exp \- (=)"} , 


and form € No, ne N, 


v(m.) = exp | (=) | 
eS : ~\anta 


Define the functions 


a Amv(m, N) 
&n(s: tao = 
and 
= bmu(m, n, a) 
n ’ 36 = T apie hake 
C,(s, 01; b) dX anes 


Then it is known (see [2, 6]) that the latter series are absolutely convergent for 
o> 5. For brevity, let 


ols + ikh,a;a,6) = (£,(s + ikhy; a), €,(s + ikhz, a; b)) 


and 
1 
PynlA) = 4 {0 <k<N: £ (s+ ikh,aa,b) eal 
A € B(H?(D)). 


Similarly, for @ = (@,,@2) € 2, define 


oe) 


&n(s,1;a) = >> 


m=1 


Am® (m)v(m, n) 


ms 
and 


bn@2 (m)v (m, n, a) 


en(s, a, 2; b) = ~~ (m + ays 


m=0 
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the series also being absolutely convergent for 0 > 5. Let 
Cs + ikh, ot, w; a,b) = (Cn(s + ikhy, w1; 4), Ons + ikhg, ot, @2; 6), 


and 


1 
Pynw(A) = ee {o SkKEN: € (s+ ikh,a,w; 0,6) € al : 
A € B(H’(D)). 


Now, using Lemma 2.1, Theorem 5.1 of [1] and the invariance of the Haar measure 
my, we easily find that, on (H?(D), B(H?(D))), there exists a probability measure 
P,, such that the measures Py, and Py». both converge weakly to P,, as N > ov. 
The latter assertion is an analogue of Theorem 8 from [3]. 

The next step of the proof of Theorem 3.1 consists of approximation. For this 
purpose, the metric of H?(D) is substantial. It is well known that there exists a 
sequence of compact sets {K;: 1 € N} C D such that 


[e.) 
D= uy Ki, 


K; C Ki41 for! € N, and if K C D is a compact set, then K C K; for some /. For 
81,82 € H(D), define 


(ei. gs) = 2 Stace 168) = 8200) 
ae l=1 oe SUP;ex, |gi(s) = g0(s)| , 


Then p is a metric on H(D) which induces the topology of uniform convergence on 
compacta. Now if, for g, = (811, 812), 8, = (821, 822) € H?(D), 


(8+ 85) = may ley: 82). 


then p is a metric in H*(D) inducing its topology. 
We also remind the Gallagher lemma, Lemma 1.4 of [10], which allows to deduce 
discrete mean square estimates from those of continuous type. 


Lemma 3.2 Let Ty and T > 6 > 0 be real numbers, and let T be a finite set lying 
in the interval [Ty + os To +T—- $), Define 


Ns) = D> 1, 
teT 


|t—x|<8 
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and let S(x) be a complex-valued continuous function on [To,T + To] having a 
continuous derivative on (Ty, T + To). Then 


1 


T T T 2 
Sartaisor = 5 f iseoyrax + ( | IsePax | Is" Pas) , 


1€T To To To 


For n € N, let 


where J"(s) is the Euler gamma-function. Then an application of the Mellin formula 
1 b+ioco 
— I(s)ja “ds =e“, a, b>O0O, 
2ni b—ioo 
leads to the representation 
1 b+ioo dz 
iia = sf to+aMnoS, 
2ni b—ioo Zz 


Let K C D be a compact subset. Then the above representation for ¢,,(s; a) and an 
application of the residue theorem imply, for an arbitrary h, > 0, the estimate 


N 
1 
— > sup |C(s + ikhy; a) — €,(s + ikhy; a)| 
N 2 suplt : ; 
[o,@) 1 N 
«| ln(G + it)| | — Jo |E(o + it + ikhy + it; a)| } dt, (3.1) 
=e2 N i=0 


where G < 0, ; < o < 1 and t is bounded by a constant. Since, for a fixed o, 


7 <o < 1, the estimate [6] 


T 
i |€(o + it; a)|?dt = O(T) 
0 
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is valid, Lemma 3.2 implies the bound 


N 


1 
N 2 
1 1 
yD I$ + it + ikl + it;a)| < (F Yoo + it + ikhy + it: 0] 
k=0 k=0 

Nh, 


<— \(o + it + it: a)|"dt+ 
hy Jo 


Nie 


Nh, Nh, 
(/ clo + ir iesayPar f (0+ i+ ina) Pat) <K 14+ lot}. 
0 0 


Therefore, in view of (3.1), 


N 


lim lim su su S+ ikh,; a) — €,(s + ikh,; a)| = 0. 
Jim, i ag 2 150) — nls + iki; a)| 


From this, by the definition of the metric p, we find that 


N 


lim lim su S+ ikh,; a), O,(s + ikhy; a)) = 0. 
Jim, imsup ey LP 1:0), Gu(s + iki: a)) 


(3.2) 


In [5], Theorem 4.1, it was shown that for an arbitrary hy > 0 and a compact 
subset K C D, 


N 
jim lim sup es sup |€(s + ikhz, a; 6) — ¢,(s + ikh2, a; 6)| = 0 
00 n00 N+ 11> sex 
Thus, 
N 
lim lim aa em p (f(s + ikhy, a; 6), f(s + ikh, a; 6)) = 0. 
n—->0O ny-> +1 aml 


The latter equality together with (3.2) and the definition of the metric p shows that 


N 

1 
lim lim su + ikh, a: a,b), + ikh,a:a,6)) = 0. 3.3 
noo it Ma 1 A U pi ) [AG U pla )) ( ) 


Standard arguments show that for almost all @, € 221, 


gi 
/ |C(o + it, a; a)|"dt = O(T) 
0 
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with a fixedo, ; <o < 1. Therefore, by a reasoning as in the proof of (3.2) we find 
that, for almost all @,; € 2), 


N 


lim lim SUP re is p(C(s + ikhy, @1; 0), on(s + ikh,, @;.a)) = 0. (3.4) 
k=0 


N00 N-s99 


The linear independence over Q of the set L(P,a@,h,,h2,7) implies the linear 
independence of the set 


| (on +a) m € No), 7 


Thus, repeating the proof of Lemma 2.6 of [4], we have that for almost all 2 € Qo, 


lim lim sup 
noo N->co 


N 
; S> p(E(s + ikha, a, 2; 6), &n(s + ikhz, o, a2; 6)) = 0. 
k=0 


This together with (3.4), for almost all w € 2, implies the equality 


N 


0 (Ks + ikh, a, @; a, b).g (s + ikh,a,w;a, b)) 


k=0 


lim lim sup 
NC n5909 N+1 


where 
O(s + ikh, a, @;a,6) = (C(s + ikh), 1; a), 6 (8 + kha, a, w2; b)). 


For w € £2, we define one more probability measure by 


Pyw(A) = waitlosksn: G(s + ikh, a, @; 4, 6) eal, 
A € B(H’(D)). 


Then the weak convergence of Py, and Py,,, to the same measure P,, as 
N — ov, equalities (3.3) and (3.5), and Theorem 4.2 of [1] show that the 
measures Py and Py, both converge weakly to the same probability measure P 
on (H?(D), B(H?(D))) as N > ov. 

It remains to prove that P = Pr. For this, Lemma 2.2 is applied. On the space 
(2, B(Q), mz), define the random variable 6 by the formula 


1 if C(s, a, w; a,b) EA, 
0 otherwise, 


6(w) = 
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where A is a fixed continuity set of the measure P. Then, for the expectation E@, we 
have 


i0 = / Bdmy = P(A). (3.6) 
. is 


Lemma 2.2 together with the Birkhoff—Khintchine ergodicity theorem yields the 
equality 


N 
sim, eT LP Qin (@)) = EO. (3.7) 


On the other hand, by the definitions of @ and gyn, .,,, we have that 


N 


1 
WT 2 8 (Yes sta)) = Prl(A). (3.8) 
k=0 


Since Py, converges weakly to P, and A is a continuity set of P, it follows that 
lim Py(A) = P(A). 
N->oo 
However, (3.6)—-(3.8) imply that, for every continuity set A of P, 
P;(A) = P(A). 


Hence, P; = P. 

The limit measure P; does neither depend on hy; and hy nor on the arithmetic 
of a. Therefore, by Lemma 12 of [3], the support of P¢ is the set S x H(D). The 
theorem is proved. 


4 Proof of the Universality Theorem 


A proof of Theorem 1.3 is based on Theorem 3.1 and the Mergelyan theorem on the 
approximation of analytic functions by polynomials, and is quite standard. We state 
the Mergelyan theorem as the following lemma: 


Lemma 4.1 Let K C C be a compact set with connected complement, and let f(s) 
be a continuous function on K which is analytic in the interior of K. Then, for every 
€ > 0, there exists a polynomial p(s) such that 


sup | f(s) — p(s)| < «. 
seEK 
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The proof of this lemma can be found in [8] and [12]. 
We will also use an equivalent of the weak convergence of probability measures 
in terms of open sets. 


Lemma 4.2 Suppose that P,, n € N, and P are probability measures on (X, B(X)). 
Then P,, converges weakly to P as n — ov if, and only if, for every open set G C X, 


lim inf P,(G) > P(G). 
noo 


The proof of this lemma is given in [1], Theorem 2.1. 


Proof (Proof of Theorem 1.3) By Lemma 4.2, there exist polynomials p;(s) and 
P2(s) such that 


sup | fi(s) — | < 5. (4.1) 
se€K, 2 
and 
E 
sup | f2(s) — p2(s)| < 5. (4.2) 
SEK? 2 
Define the set 


G= 


pile & é 
(g1.82) € H°(D): sup |gi(s) — "| < =, sup |go(s) — p2(s)| < =? . 
sek, 2 s€K> 2 


Then we observe that G is an open set, and, in view of the second assertion 
of Theorem 3.1, (e?!), p(s)) is an element of the support of the measure P. 
Therefore, P;(G) > 0. Thus, by first assertion of Theorem 3.1 and Lemma 4.2, 7 


lim inf 
Noo 


1 
“#10 <k SN: C(s + ith, a, 03.4, 6) eal > P:(G) > 0. 


Therefore, the definition of G implies that 


lim inf 
Noo 


1 
#20 <k<N: sup [f(s + ikhy; a) — | < S, 
1 se€Ky 2 


€ 
sup |¢(s + ikha, a; 6) — po(s)| < =p > 0. 
s€K2 2 


Combining this with (4.1) and (4.2) proves the theorem. 
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Remembering Wolfgang Schwarz, His Life 
and Work 


Lutz G. Lucht 


Abstract The essay is the author’s very personal retrospective on the life and work 
of Wolfgang Schwarz. Besides biographical details of Schwarz’s private life and 
his professional career, it gives account of his work on multiplicative functions in 
collaboration with several colleagues. 
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On July 19, 2013, Wolfgang Schwarz passed away in his home in Kelkheim after a 
serious disease. He was an outstanding, highly prolific, scientifically influential and 
innovative number theorist of international renown, a good friend, and a masterful 
teacher. His far-ranging professional work in pure mathematics was devoted to the 
large family of mathematics students and researchers. In his papers, monographs, 
lectures, and talks, he liked to share ideas and discoveries in mathematics with 
experts as well as with a broad audience. His passing is a severe loss to the field 
of mathematics in general, and to the theory of arithmetic functions in particular, 
and his loss will be deeply felt. 

Wolfgang is survived by his wife Doris and daughters Karin and Eva with their 
families including six grandchildren, to whom he was a devoted husband, father, 
and grandfather. He and Doris were married for more than five decades and endured 
some tremendous hardships, but never lost their courage to face life. BothWolfgang 
and his wife, who quickly became a real actor in the mathematical world, supported 
numerous mathematicians by invitations, great hospitality, and warmth in their home 
in Kelkheim. Those of us who were their guests will treasure these happy memories. 
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1 Early Years 


Wolfgang Schwarz was born on April 21, 1934, in Selb, a small town located in the 
Fichtel mountains, famous for the Rosenthal porcelain manufacturer, at the North- 
East of Bavaria near the borders to Thuringia and Saxony. There he developed a 
sportive passion for hiking, mountaineering, and skiing already in young years. 

At age 10, he lost his father in WW II, a bitter fate that he shared with 
many children during those war times. Due to his talents, diligence, versatility, 
and sportsmanship Schwarz mastered the Gymnasium with excellence during the 
difficult postwar period. He obtained the Abitur at age 17. In addition to his sporting 
activities he emerged as a very good chess and piano player. 

Supported by a Hundshammer scholarship he studied mathematics and physics 
for a teaching certificate at the University of Erlangen-Niirnberg from 1951 until 
1956. His thesis on transcendental p-adic numbers was supervised by Theodor 
Schneider. Schneider, a student of Carl Ludwig Siegel, is well known for his 
solution of the 7th Hilbert problem—independently from Gelfond—in his 1934 
Frankfurt dissertation. 

Until 1960 Schwarz was Schneider’s assistant and obtained his doctorate in 1959 
with a dissertation on additive prime power representations of positive integers. His 
subsequent time at the university of Freiburg certainly was the happiest period in 
Wolfgang Schwarz’s life, as he enjoyed the loyal atmosphere and high standards of 
the department, and the good libraries in Freiburg and the nearby Oberwolfach; he 
made life-long friends and, in particular, he met his wife Doris. 

Although mathematics took most of his time, he kept up a multitude of activities 
such as playing chess on a high level, mountaineering, and climbing, even when he 
was short on time. This is shown by the photos (Figs. | and 2). 
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Fig. 1 Bavarian Chess Congress, Bad Kissingen, 1954. Published with kind permission of © Doris 
Schwarz, 2013 


Fig. 2. On the /eft: mountaineering, +1955; on the right: climbing in the Freiburg Library, ~ 1960. 
Published with kind permission of © Doris Schwarz, 2013 


2 Survey of Schwarz’s Career and Work 


When Schneider was appointed to a full professorship at the University of Freiburg 
in 1960, he took Schwarz as postdoctoral research fellow with him to Freiburg. 
There Schwarz obtained the habilitation in 1964 with a thesis on Tauberian 
theorems, received a position as lecturer and, in 1969, as professor. 

In the same year he was appointed to a full professorship at the University of 
Frankfurt/Main, where he reestablished an internationally respected number theory 
research group. 
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The impressive list of Wolfgang Schwarz’s scientific ancestors dates back to the 
early eighteenth century and contains some of the most famous mathematicians of 
the modern age: 


Christian August Hansen 


Abraham Gotthelf Kostner 1739 Leipzig 

Johann Friedrich Pfaff 1788 Helmstedt 

Carl Friedrich Gaui 1799 Helmstedt 
Christoph Gudermann 1841 Gottingen 

Karl Theodor Wilhelm Weierstra 1854 Konigsberg 

Georg Ferdinand Frobenius 1870 Berlin 

Edmund Landau 1899 Berlin 

Carl Ludwig Siegel 1920 Gottingen 
Theodor Schneider 1934 Frankfurt am Main 
Wolfgang Karl Schwarz 1959 Erlangen-Niirnberg 


In Wolfgang Schwarz’s early years, sieve methods, Tauberian theorems, and the 
boundary behavior of power series dominated his research. Later he focused on 
Banach algebras, multiplicative semigroups, and inversion of arithmetic functions, 
the approximation of arithmetic functions by exponential functions, structural 
properties of algebras of almost-even arithmetic functions, their mean behavior and 
Fourier-Ramanujan expansions, classes of non-multiplicative, non-additive, iterated 
or shifted arithmetic functions and their properties. 

He published more than 120 papers and monographs and was a very successful 
academic teacher guiding fourteen doctoral students. The topics of their disserta- 
tions reflect the breadth of his mathematical interests. 


Karl-Heinz Indlekofer 1970 Freiburg 
Johannes Duttlinger 1972 Frankfurt 


Ernst Heppner 1974 Frankfurt 
Evelyn Brinitzer 1977 Frankfurt 
Manfred Gottschalk 1977 Frankfurt 
Bruno Tenderra 1977 Frankfurt 
Thomas Maxsein 1985 Frankfurt 
Joachim Herzog 1987 Frankfurt 
Peter Kunth 1988 Frankfurt 
Kurt Hohne 1991 Frankfurt 
Rainer Tschiersch 1995 Frankfurt 
Peter Bauer 1997 Frankfurt 


Christian Elsholtz 1998 Darmstadt 
Matthias Frost 2008 Frankfurt 
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Supported by Wolfgang Schwarz, Karl-Heinz Indlekofer, Bohuslav Divié, Ernst 
Heppner, and Jorn Steuding obtained their habilitation at Frankfurt University. 

Three of them, Karl-Heinz Indlekofer in Paderborn, Jorn Steuding in Wiirzburg 
and Christian Elsholtz in Graz, have followed in his footsteps as university teachers 
and researchers. Numerous students, assistants, and colleagues were fascinated by 
his courses, lectures, papers, and monographs, his versatile insight, his generous 
helpfulness, and his interest in imparting his rich knowledge. They were stimulated 
to enhance his methods and results and to contribute to the solution of resulting 
mathematical problems, and will always remember him with gratitude. 

Wolfgang Schwarz’s monographs on prime number theory [20], sieve methods 
(25], elementary number theory [26], and arithmetical functions [32], the latter 
jointly written with Jiirgen Spilker (Freiburg), resulted from his research interests 
and preparations of courses given in Freiburg and Frankfurt. They contain a large 
number of footnotes with valuable hints and thoroughly researched bibliographical 
remarks, a specialty of Schwarz’s publications. In particular, the monograph 
Arithmetical Functions has established itself as standard reference in this area of 
number theory, to which both the authors and their students have contributed many 
significant results. 

In the recent past Wolfgang Schwarz was also engaged in research in the history 
of mathematics. An important aspect was the Mathematical Seminar of Frankfurt 
University, the fate of mathematicians in Frankfurt after 1930 (see the remarkable 
Festschrift [30]), and the history of number theory (Fig. 3). 

In addition to many official duties in Frankfurt, Wolfgang Schwarz was 
active as 


¢ Longtime member of the Mathematical Seminar of Frankfurt University 

¢ Longtime member of the Wissenschaftliche Gesellschaft at Frankfurt University 

¢ Longtime trustee of the Catholic Church’s Cusanuswerk at Frankfurt University 

¢ Co-organizer (with H.-E. Richert and E. Wirsing) of the MFI Oberwolfach 
conferences on Elementary and Analytical Number Theory from 1974 to 1994 

¢ President of the Deutsche Mathematiker-Vereinigung from 1986 to 1987 


Fig. 3 Inaugural address at 
the Annual Convention of the 
DMV, Berlin, 1987. 
Published with kind 
permission of © Doris 
Schwarz, 2013 
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¢ Longtime member of the DMV executive board 

¢ Spokesperson of the Konferenz der Mathematischen Fachbereiche from 1992 to 
1993 

¢ Organizer of the 3rd ELAZ-Conference in Mainz and the preceding Number 
Theory Day in Frankfurt in 2004 

¢ Author, editor, and translator of more than 120 research papers and several books 

¢ Co-editor (with P.T. Bateman, L. Mirsky, H.L. Montgomery, W. Schaal, I.J. 
Schoenberg, H. Wefelscheid) of Edmund Landau’s Collected Works in nine 
volumes, 1985-1987 

e External supervisor and reviewer of numerous doctoral and habilitation theses, 
and papers 


Wolfgang Schwarz mastered a tremendous workload which he sometimes put on 
hold in favor of his family, of recreation, of joint ventures with friends, or simply of 
leisure activities (Figs. 4 and 5). 

Sometimes joint ventures with friends also consisted of walks in the hills 
depending on adequate weather. Unseasonal conditions could then lead to a joint 
mathematical paper. For instance, the first footnote in a joint paper with Spilker 
[31] from 1983 might have originated in bad weather during a joint hiking tour in 
Austria. (It says: The draft of this note resulted from joint mountain hikes in Rdtikon 
and in Otztal.) In 1982, I received a copy of the preprint which I picked from my 
collection of Schwarz’s papers: 


EINE BEMERKUNG ZUR CHARAKTERISIERUNG DER FAST-PERIODISCHEN 
MULTIPLIKATIVEN ZAHLENTHEORETISCHEN FUNKTIONEN MIT VON NULL 
VERSCHIEDENEM MITTELWERT!) 


Wolfgang Schwarz 
Jirgen Spilker 


ABSTRACT. 


Denote by «@ the vector space of BY-almost-periodic arithmetical 
functions. H. Daboussi [2], [3] and P.D.T.A. Elliott [8], [10] 
characterized the set of multiplicative functions in o&% with 
mean-value M(f) #0 by a statement concerning the convergence of 
certain series extended over the values £ (p*) of £ at prime powers 
(see (1.21), (1.22), and (1.23)). The authors give a simpler 
proof for the implication: If fe. /4F (q>1) is multiplicative 
with mean-value M(f) #0, then the series mentioned above are 
convergent. The proof consists in reducing the assertion to 

the special case q=2, for which a simple proof is available by 
Daboussi - Delange [3] (see also [24]). 


AMS classification: 10H45, 10K35. 


1) =m Entwurf entstand diese Note auf gemeinsamen Bergtouren im 
Ratikon und im Otztal. 
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Fig. 4 Making music with Peter Elliott, Oberwolfach, +1994. Published with kind permission of 
© Doris Schwarz, 2013 


Fig. 5 Skiing in front of the K6nigsspitze in Sulden, South Tirol, +1996. Published with kind 
permission of © Doris Schwarz, 2013 


Despite receiving notable enquiries from other universities, Wolfgang Schwarz 
remained in Frankfurt. After becoming an emeritus in 2002, he continued research, 
published papers, attended conferences, gave lectures, and supported many young 
academics until 2012. 
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3 Reminiscences 


I met Wolfgang Schwarz for the first time in the winter semester 1970/71. He 
was invited to give a colloquium talk in Clausthal which focused on the algebra 
of arithmetic functions and the subgroup of multiplicative functions under the 
Dirichlet convolution together with many applications, a topic that influenced my 
later teaching and research. 

In the same semester Eduard Wirsing was also invited, and I could easily guess 
that both had been chosen by the faculty as external reviewers of my impending 
habilitation with a thesis on the asymptotic behavior of sums of multiplicative 
functions, closing a gap between results of Wirsing and those of Schwarz. 

A few days after Schwarz’s talk, I received a small parcel with offprints. This 
turned into a regular correspondence and the exchange of preprints of many papers, 
and I was frequently invited to attend the Oberwolfach conferences on Elementary 
and Analytic Number Theory. 

When I was asked to give a seminar talk at Frankfurt University in June, 
1974, Schwarz had reserved the department’s guest room in the mathematics tower 
building. After my arrival and a brief check-in he took me to his and his family’s 
newly built Taunus home where Mrs. Schwarz prepared a dinner for all members 
and guests of his institute. The evening with about 12 persons was a stunning event, 
not only because of the excellent dinner (in particular, I remember the homemade 
Frankfurt Green Sauce, a local specialty), but also because of the hospitality and 
warmth, combined with good conversation and human attention. I immediately 
realized the sharp contrast to the atmosphere of the ugly tower building and its dirty 
surroundings. Let me recall that during the second half of the 1970s, Frankfurt was 
marked by fierce student’s protests, the APO (a radical student organization), and 
the emerging terrorist RAF. 

A few weeks after my Frankfurt talk I obtained a letter from Wolfgang Schwarz 
which contained a draft of an extension of the result presented in my talk; this led 
to a joint paper [18]. From the late 1980s on the correspondence was eased by the 
internet, with emails and file transfers replacing letters. 

In the 1990s the MFI Oberwolfach had to change its policy which resulted 
in limitations of the number of participants at conferences, and thus of younger 
academic visitors. When I first asked Wolfgang Schwarz about the prospect of 
starting separate conferences on elementary and analytic number theory particularly 
for young academics, he and several other colleagues convinced me to organize a 
low budget conference in Goslar. On his recommendation I was able to hire his 
postdoc Christian Elsholtz as assistant. In 2000, he and some student assistants 
helped to host that first ELAZ conference in Goslar, with 26 registered participants 
from five countries. In 2012, Schwarz and I were very pleased that the ELAZ 
conferences had developed so well, for the number of participants of the ELAZ 
conference organized by Jérn Steuding had risen to 71 participants from 27 
countries. 
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Until 2012 Wolfgang Schwarz was full of health and strength, staying true to his 
high ideals, and following his example kept me going. I am deeply indebted to him 
for his steady support during more than 40 years. 

The subsequent sections contain a survey of Schwarz’s work on three interrelated 
topics belonging to the theory of arithmetic functions. They are chosen according 
to my own research interests and have been initiated and developed further by 
Wolfgang Schwarz, his students, and colleagues. 


4 Inversion Theorems for Multiplicative Functions 


The 1960s and 1970s have been important decades for the development of the theory 
of arithmetic functions, in particular, that of multiplicative and additive functions. 
In 1967, Wirsing [38] solved the Erd6s—Wintner problem and proved that every real 
multiplicative function of modulus <1 has a mean-value, a result that is equivalent 
to the prime number theorem. In 1968, Haldsz [6] solved Wirsing’s problem by 
determining the asymptotic behavior of the partial sums of the values of complex 
multiplicative functions of modulus <1 at arguments n < x. What these papers have 
in common is a simplification of the problem by replacing the original multiplicative 
function f with a simpler related one, coinciding with f on the set of sufficiently 
large primes only and neglecting the values at higher prime powers. In fact, there 
is a previous paper of Delange [2] from 1961, which introduces the notion of 
related multiplicative functions and gives some applications. In 1983, Indlekofer 
[10] proved a general mean value theorem for uniformly summable multiplicative 
functions. 

In the 1970s, Schwarz considered this strategy under structural aspects and 
began to develop methods for transferring mean properties between complex valued 
multiplicative functions f and g. Let .@ denote the class of multiplicative functions. 

From a more general point of view, Schwarz’s approach may be described as 
follows: The deviation between arithmetic functions f, g: N > C with g(1) 4 Ocan 
be considered small if the Dirichlet series h(s) of h (implicitly) defined by f = g*h 
converges absolutely at s = 1. This is an £'-norm condition on h, which for f, g € 
M is equivalent to 


Ye lap) p* < 00. 


pk=1 


Here and in the sequel, the letter p always denotes primes. After a thorough 
analysis of Delange’s result, Schwarz [21] uses the Wiener inversion theorem for 
power series for an elegant proof of an extension of Delange’s result. Let .Y be 
Schwarz’s class of multiplicative functions f bounded in mean, and satisfying the 
local conditions f?(p) « p!~>, f(p") « c" for r > 2 with some constants 6 > 0 
and c, 1 < c < 2. Denote by J the class of arithmetic functions h: N — C with 
absolutely convergent Dirichlet series h(1). 
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Theorem (Schwarz [21]) Let f, g € .7 satisfy 


Y>If(p) — g(p)|p! < 00. 
P 


If no Euler product factor g,(s) = 1+ g(p)p-* + g(p*)p ~ +--+ of 8(s), p prime, 
has a zero in the closed half plane Res > 1, thenh := f * g~! € GY. In particular, 


the existence of the mean value M(g) = lim (a> » g(n)) implies that of M(f) = 
X00 n<x 


h(1) - M(g). 


This theorem has some important applications, but also some undesirable 
restrictions concerning the above local conditions of Wirsing type (see Wirsing [37]) 
and, especially, the shortness of the class .7. In fact, the local conditions on f and 
g can be replaced with mean conditions extending .” further, which yields some 
mean value applications (see [13]). 


5 Collaborations with Spilker and Heppner 


Due to the general importance of Wiener type inversion, the proof of the original 
Wiener inversion theorem [36] for Fourier series and power series was simplified by 
Gelfand’s theory of commutative Banach algebras, extended to ordinary Dirichlet 
series by Hewitt and Williamson [8] and, moreover, to general Dirichlet series by 
Edwards [3]. 

When Spilker and Schwarz [33] began to inspect Gelfand’s theory in the context 
of arithmetic functions, they found even shorter proofs of inversion theorems in 
commutative Banach spaces of arithmetic functions corresponding to generating 
power series, Fourier series, and Dirichlet series. In particular, their new approach 
to the inversion theorem for Dirichlet series, based on the Kronecker approximation 
theorem and the maximum principle for holomorphic functions, produced a density 
result interesting in itself. 

Recall that, under pointwise addition and the Dirichlet convolution * , the class 
A of arithmetic functions a: N > C with finite £'-norm |la|| := S°°, |a(n)| < 00 
is a Banach algebra with unit e, e(1) = 1 and e(n) = 0 forn > 1. The spectrum 
o(a) of a € Ais the set of all A € C such that a — Xe is not invertible in A. Denote 
by H := {s ¢e C: Res > 0} C C the open right half plane. 
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Theorem (Schwarz-—Spilker [33]) For every a € A, a(H) is dense in o(a). 


As an application, Heppner and Schwarz [7] proved an inversion theorem for 
multiplicative functions or, equivalently, for Dirichlet series with multiplicative 
coefficient functions, which correspond to Euler products. 


Theorem (Heppner-Schwarz [7]) The class 


= {3 €.M: 5° |\9°(p)|p? <coand Y° |g(p)|p* < oo} 
P 


p.k>=2 
is a unital semigroup under the Dirichlet convolution * with the multiplicative group 
a= {g€ GY:0 ¢ &p(1 + Hl) for all p}. 


The class Y extends the Schwarz class .Y considerably by replacing the £'-norm 
on the image set g(P) of the set P of primes with the £?-norm. Thus the above 
theorem of Heppner and Schwarz is a remarkable structural result. In particular, the 
rigid restrictions in Schwarz’s relationship theorem from 1973 do not appear any 
longer. 


Corollary (Heppner-Schwarz [7]) Let f ¢ Y, g € Y* satisfy )7,|f(p) — 
g(p)|p—! < co and let h be defined by f = g * h. Then h(1) converges absolutely. 


In a survey paper [29, Problem 8.6] from 1985, and at conferences, Schwarz 
posed the problem of finding quantitative versions of this relationship theorem. He 
was very pleased to see the solution [14] and its arithmetic applications based on 
the investigation of suitable weight functions and Banach algebras of arithmetic 
functions with weighted norms. 

It is worth mentioning that the proof of the above density theorem of Schwarz 
and Spilker from 1979 can be simplified by using the following maximum modulus 
principle for holomorphic functions of several variables, which seems to be little- 
known. 


Lemma (Gléckner-Lucht [5]) Let G = G, x --- x Gy _with nonempty bounded 
regions Gj,...,Gx C C. If f:G — C is continuous on G and holomorphic on G, 
then | f| takes its maximum value on 0 G x--- x 0 Gx. 


Its proof by induction uses the maximum modulus principle for holomorphic 
functions (of one variable) and the Weierstrafi convergence theorem. The paper [5] 
contains a new proof of Edwards’s inversion theorem [3] for general Dirichlet series 
in terms of complex valued functions, defined on additive arithmetic semigroups, 
and includes applications to Banach algebras of multidimensional Dirichlet series 
associated with arithmetic functions. 
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6 Ramanujan Expansions 


In 1918, Ramanujan [19] introduced the sums of the ath powers of the nth roots of 
unity, 


Cn(a) = ) ern (a,n EN), 
l<v<n 
ged (v,n)=1 


later called Ramanujan sums, and gave numerous examples of pointwise convergent 
series expansions of arithmetic functions f: N > C of the form 


f{@= > feta) (@eN) 


n=1 


with suitable so-called Ramanujan coefficients fi (n). Such representations are called 
Ramanujan expansions of f. 
Well-known properties of the Ramanujan sums are: 


(a) for any n € N fixed, cy(@) = Coca (na) (@) is an n-even function of a, 
(b) foralln,ae€N,c,(a)= Yo dyu(4) € Rholds, 

d|gcd (n,a) 
(c) for any a € N fixed, c,(a) is a multiplicative function of n. 


Among Ramanujan’s examples of expansions we find the evaluations 


0= > a and —1l= > a2 logn (aéN), 


n=1 i n=1 * 
which both are equivalent to the prime number theorem, in the latter case with 
nontrivial remainder term. In particular, the first example shows that Ramanujan 
expansions are not uniquely determined. 

For a long time, Ramanujan expansions remained a mysterious topic in number 
theory. In 1932, Carmichael [1] suggested to find a Fourier analytic approach. 
In 1981, Hildebrand [9] showed by an inductive construction that every arith- 
metic function has a Ramanujan expansion. However, Hildebrand’s expansions 
appear strange and have nothing in common with the “natural” ones displayed by 
Ramanujan. 

In 1975, Knopfmacher [12, Chap.7] presented a survey containing a general 
mean-value based concept for convergence of the above Ramanujan series in 
quadratic mean and, moreover, suggested to develop a new concept, which pre- 
serves and explains known expansions that do not fit in the mean-value based 
concept. 
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The problem of finding a general mean-value based approach to pointwise 
convergence remained open until Schwarz, encouraged by the progress of mean- 
value theorems for multiplicative functions, began to develop such a concept in 
1972. Ina series of papers [22—24, 27, 28] he considered the Ramanujan coefficients 


n » _ M(gcn) 
g(n) = 50) 


of g € .@, where ¢ is the Euler function, and proceeded in three steps, namely 


1. determine suitable classes of arithmetic functions such that the above mean- 
values exist, 

2. evaluate the Ramanujan coefficients, 

3. prove pointwise convergence of the Ramanujan series to the “correct” value. 


With technically intricate proofs he succeeded to verify the following 


Theorem (Schwarz [26]) Let g € -@ be bounded by 1. If M(g) # 0 exists then 
the Ramanujan coefficients g(n) exist for all n € N, and the respective Ramanujan 
series converges pointwise to g. 


In 1980, Tuttas [34] extended Schwarz’s result to the class &) of multiplicative 
functions characterized by Elliott [4]. A further extension to the generalized Elliott 
class &4 with q > 1 follows from papers of Indlekofer [11] and Warlimont [35]. 

Schwarz’s approach to pointwise convergent Ramanujan expansions is restricted 
to multiplicative functions having a mean-value and satisfying certain approxima- 
tion conditions used by Schwarz and Spilker in [32] to define and explore suitable 
normed algebras of almost-even arithmetic functions. It does not explain the known 
Ramanujan expansions of, for instance, the divisor function d(n) or the counting 
function r(n) of the number of points (a,b) € Z? ina circle of radius ./n. 

In 1993, a new concept was established in [14] (see also [15] and [17]), which 
solved both problems to find a quantitative “weighted” version of the Heppner- 
Schwarz relationship theorem and a new concept of Ramanujan expansions. The 
latter is based on a weighted relationship between the Ramanujan sums and the 
Mobius function (see the survey paper [16]), and explains and preserves the 
previously “mysterious” Ramanujan expansions. Schwarz was very surprised and 
delighted. He said, if he could change the monograph Arithmetical Functions he 
would rewrite several chapters. 

All of us who knew and worked with Wolfgang Schwarz are deeply indebted to 
him for his steady support, his friendship, and his ideas, which he kindly shared 
with us. 

At Oberwolfach conferences on elementary and analytical number theory Paul 
Erdés loved to talk about God’s Book containing the most beautiful mathematical 
theorems and their ultimate proofs. Let us think of Wolfgang Schwarz now happily 
studying The Book with all the wonderful results, to which he has contributed so 
much during his time on earth. 
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Dedicated to the memory of Professor Wolfgang Schwarz 


Abstract We give a survey on classical and recent applications of dynamical 
systems to number theoretic problems. In particular, we focus on normal numbers, 
also including computational aspects. The main result is a sufficient condition for 
establishing multidimensional van der Corput sets. This condition is applied to 
various examples. 
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1 Dynamical Systems in Number Theory 


In the last decades dynamical systems became very important for the development 
of modern number theory. The present paper focuses on Furstenberg’s refinements 
of Poincaré’s recurrence theorem and applications of these ideas to Diophantine 
problems. 

A (measure-theoretic) dynamical system is formally given as a quadruple 
(X, 8, u,T), where (X, 8, 2) is a probability space with o-algebra 8 of mea- 
surable sets and jz a probability measure; T:X — X is a measure-preserving 
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transformation on this space, ie. u(T~'A) = (A) for all measurable sets 
A € SS. In the theory of dynamical systems, properties of the iterations of the 
transformation T are of particular interest. For this purpose we only consider 
invertible transformations and call such dynamical systems invertible. 

The first property, we consider, originates from Poincaré’s famous recurrence 
theorem (see Theorem 1.4 of [32] or Theorem 2.11 of [13]) saying that starting 
from a set A of positive measure j4(A) > 0 and iterating T yields infinitely many 
returns to A. More generally, we call a subset R C N of the positive integers a set 
of recurrence if for all invertible dynamical systems and all measurable sets A of 
positive measure j4(A) > 0 there exists n € FR such that w(A M T"A) > 0. Then 
Poincaré’s recurrence theorem means that N is a set of recurrence. 

A second important theorem for dynamical systems is Birkhoff’s ergodic 
theorem (see Theorem 1.14 of [32] or Theorem 2.30 of [13]). We call T ergodic if 
the only invariant sets under T are sets of measure 0 or of measure 1, i.e. 7'A = A 
implies 4(A) = 0 or w(A) = 1. Then Birkhoff’s ergodic theorem connects average 
in time with average in space, i.e., 


1 N-1 
Jim = Defer") = [ Flx)dye(x) 


n=0 


for all f € L'(X, w) and p-almost all x € X. 

Let us explain an important application of this theorem to number theory. For 
q = 2 positive integer, consider T:[0,1) — [0,1) defined by T(x) = {qx}, 
where {tf} = t — |t| denotes the fractional part of 4. If x € R is given by 
its g-ary digit expansion x = |x| + )°*°, aj(x)q7/, then the digits aj(x) can be 


q 
d € {0,1,...,q—1}. Moreover, since a;(Tx) = aj+1(x) forj > 1 the transformation 
T can be seen as a left shift of the expansion. 

Now we call a real number x simply normal in base gq if 


computed by iterating this transformation T: aj(x) = d if T7~'x € 7 a1) with 


oe ed . 1 
im yi < N:aj(x) =d} = F 


foralld = 0,...,q—1,i.e., all digits d appear asymptotically with equal frequencies 
1/q. A number x is called g-normal if it is simply normal with respect to all bases 


q.¢°.q°,.... This is equivalent to the fact that the sequence ({q"x})nen is uniformly 
distributed modulo 1 (for short: u.d. mod 1), which also means that all blocks 
d,,dz,...,d, of subsequent digits appear in the expansion of x asymptotically with 


the same frequency q~“ (cf. [8, 12, 16]). For completeness, let us give here one 
possible definition of u.d. sequences (x,): a sequence of real numbers x, is called 
u.d. mod 1 if for all continuous functions f : [0,1] > R 


ia l 
fim = of) = i Fla)de. (1) 


n=1 
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Note, that by Weyl’s criterion the class of continuous functions can be replaced 
by trigonometric functions e(hx) = e?""", h € N or by characteristic functions 1;(x) 
of intervals J = [a, b). Applying Birkhoff’s ergodic theorem shows that Lebesgue 
almost all real numbers are g-normal in any base g > 2. Defining a real number to 
be absolutely normal if it is g-normal for all bases g => 2, this immediately yields 
that almost all real numbers are absolutely normal. 

In particular, this shows the existence of absolutely normal numbers. However, 
it is a different story to find constructions of (absolutely) normal numbers. It is a 
well-known difficult open problem to show that important numbers like /2, In 2, e, 
x etc. are simply normal with respect to some given base g => 2. A much easier task 
is to give constructions of g-normal numbers for fixed base g. Champernowne [9] 
proved that 


0.123456789101112... 


is normal to base 10 and later this type of constructions was analyzed in detail. So, 
for instance, for arbitrary base g > 2 


0.(LgQ)J)q (Le) ])q-- 


is q-normal, where g(x) is a non-constant polynomial with real coefficients and 
the g-normal number is constructed by concatenating the q-ary digit expansions 
(Lg(n)])q of the integer parts of the values g(n) forn = 1,2,.... These constructions 
were extended to more general classes of functions g (replacing the polynomials) 
(see [11, 18, 19, 22, 23, 29]) and the concatenation of ([g(p)]), along prime numbers 
instead of the positive integers (see [10, 17, 18, 24]). 

All such constructions depend on the choice of the base number q > 2, and thus 
they are not suitable for constructing absolutely normal numbers. A first attempt 
to construct absolutely normal numbers is due to Sierpinski [30]. However, Turing 
[31] observed that Sierpinski’s “construction” does not yield a computable number, 
thus it is not based on a recursive algorithm. Furthermore, Turing gave an algorithm 
for a construction of an absolutely normal number. This algorithm is very slow and, 
in particular, not polynomially in time. It is very remarkable that Becher et al. [2] 
established a polynomial time algorithm for the construction of absolutely normal 
numbers. However, there remain various questions concerning the analysis of these 
algorithms. The discrepancy of the corresponding sequences is not studied and the 
order of convergence of the expansion is very slow and should be investigated 
in detail. Furthermore, digital expansions with respect to linear recurring base 
sequences seem appropriate to be included in the study of absolute normality from 
a computational point of view. 
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Let us now return to Poincaré’s recurrence theorem which shortly states that the 
set N of positive integers is a recurrence set. In the 1960s various stronger concepts 
were introduced: 


1. R C Nis called a nice recurrence set if for all invertible dynamical systems and 
all measurable sets A of positive measure (A) > 0 and all ¢ > 0, there exist 
infinitely many n € FR such that 


w(ANM TA) > pA) —«. 


2. H C N is called a van der Corput set (for short: vdC set) if the following 
implications holds: 


(Xn+n — Xn)nen is u.d. mod 1 for all h € H = > (%)nen is u.d. mod 1. 


Clearly, any nice recurrence set is a recurrence set. By van der Corput’s difference 
theorem (see [12, 16]) the set = N of positive integers is a vdC set. Kamae 
and Mendés-France [15] proved that any vdC set is a nice recurrence set. Ruzsa 
[25] conjectured that any recurrence set is also a vdC set. An important tool in the 
analysis of recurrence sets is their equivalence with intersective (or difference) sets 
established by Bertrand-Mathis [5]. We call a set Z intersective if for each subset 
E CN of positive (upper) density, there exists n € Z such that n = x — y for some 
x,y € E. Here the upper density of F is defined as usual by 


= #(EN [1 
d(E) = lim sup CEA 
Noo N 


Bourgain [7] gave an example of an intersective set which is not a vdC set, hence 
contradicting the above mentioned conjecture of Ruzsa. 

Furstenberg [14] proved that the values g(n) of a polynomial g € Z[x] with 
g(0) = O form an intersective set and later it was shown by Kamae and Mendés- 
France [15] that this is a vdC set, too. It is also known that for fixed h € Z the set of 
shifted primes {p+h: p prime} is a vdC set if and only if # = +1. [20, Corollary 10]. 
This leads to interesting applications to additive number theory, for instance to new 
proofs and variants of theorems of Sark6zy [26-28]. A general result concerning 
intersective sets related to polynomials along primes is due to Nair [21]. 

In the present paper we want to extend the concept of recurrence sets, nice 
recurrence sets, and vdC sets to subsets of Z*, following the program of Bergelson 
and Lesigne [3] and our earlier paper [4]. In Sect.2 we summarize basic facts 
concerning these concepts, including general relations between them and counter 
examples. Section 3 is devoted to a sufficient condition for establishing the vdC 
property. In the final Sect.4 we collect various examples and give some new 
applications. 
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2 van der Corput Sets 


In this section we provide various equivalent definitions of van der Corput sets in 
Z* In particular, we give four different definitions, which are k-dimensional variants 
of the one-dimensional definitions, whose equivalence is due to Ruzsa [25]. These 
generalizations were established by Bergelson and Lesigne [3]. Then we present a 
set, which is not a vdC set in order to give some insight into the structure of vdC 
sets. Finally, we define the higher-dimensional variant of nice recurrence sets. 


2.1 Characterization via Uniform Distribution 


Similar to above we first define a van der Corput set (vdC set for short) in Z* via 
uniform distribution. 


Definition 2.1 A subset H C Zé \ {0} is a vdC set if any family (Xn) pent Of real 
numbers is u.d. mod | provided that it has the property that for allh € H the family 
(Xn+h — Xn)nené is u.d. mod 1. 


Here the property of u.d. mod 1 for the multi-indexed family (%)yej« is defined via 
a natural extension of 1.1: 


1 1 
lim ——————_. Xn) = / x)dx (2.1) 
N1.N2,...,Ng—> +00 NiN2 +: “Nx senoe, Pe. n) . fC ) 


for all continuous functions f : [0,1] — R. Here in the limit N,,N2,..., Nx are 
tending to infinity independently and < is defined componentwise. 

Using the k-dimensional variant of van der Corput’s inequality we could 
equivalently define a vdC set as follows: 


Definition 2.2 A subset H Cc Z* \ {0} is a van der Corput set if for any family 
(Un) nez Of complex numbers of modulus 1 such that 


1 
VheH, lim ———_. Un+hun = O 
N1,N2,....Ne—>+00 Ny No +++ Ny seneion Ne) —— 


the relation 
1 


lim ———_. 
N1.N2,....Nk—>t+oo Ny No +++ Nx 
O<n<(N1,N2,...,.Ni) 


holds. 
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2.2 Trigonometric Polynomials and Spectral Characterization 


The first two definitions are not very useful for proving or disproving that a set H is a 
vdC set. Similar to the one-dimensional case the following spectral characterization 
involving trigonometric polynomials is a better tool. 


Theorem 2.3 ([3, Proposition 1.18]) A subset H Cc Z* \ {0} is a van der Corput 
set if and only if for all e > 0, there exists a real trigonometric polynomial P on the 
k-torus T* whose spectrum is contained in H and which satisfies P(0) = 1, P > —e. 


The set of polynomials fulfilling the last theorem for a given ¢ forms a convex set. 
Moreover the conditions may be interpreted as some infimum. Therefore we might 
expect some dual problem, which is actually provided by the following theorem. 
For details see Bergelson and Lesigne [3] or Montgomery [20]. 


Theorem 2.4 ([3, Theorem 1.8]) Let Hc Z*\ {0}. Then H is a van der Corput set 
if and only if for any positive measure o on the k-torus T* such that, for allh € H, 
6(h) = 0, this implies o ({(0,0,...,0)}) = 0. 


2.3 Examples 


The structure of vdC sets is better understood by first giving a counter example. The 
following lemma shows to be very useful in the construction of counter examples. 


Lemma 2.5 Let H CN. If there exists q € N such that the set HN qN is finite, 
then the set H is not a vdC set. 


Proof The proof is a combination of the following two observations of Ruzsa [25] 
(see Theorem 2 and Corollary 3 of [20]): 


1. Let m € N. The sets {1,...,m} and {n € N:m } n} are both not vdC sets. 
2. LetH = H,; UH». CN. If H is a vdC set, then H, or H2 also has to be a vdC 
set. 


Suppose there exists a g € N such that HN qN is finite. Then we may split H into 
the sets HM qN and H \ qN. The first one is finite and the second one contains no 
multiples of g. Therefore both are not vdC sets and hence H is not a vdC set. 


The first counter example deals with arithmetic progressions. 
Lemma 2.6 Let a,b €N. If the set {an + b:n € N} is a vdC set, then a | b. 
Proof Let b € N and H = {an + b:n € N} bea vdC set. Then by Lemma 2.5 we 
must have 


an+b=b=0O0modda _ infinitely often. 


This implies that a | b. 


Dynamical Systems 269 


The sufficiency (and also the necessity) of the requirement a | b follows from the 
following result of Kamae and Mendeés-France [15] (cf. Corollary 9 of [20]). 


Lemma 2.7 Let P(z) € Z[z] and suppose that P(z) > +00 as z > +00. Then 
H = {P(n) > 0:n € N} is a vdC set if and only if for every positive integer q the 
congruence P(z) = 0 (mod q) has a root. 


Now we want to establish a similar result for sets of the form {ap + b: p prime}. 
In this case the following result is due to Bergelson and Lesigne [3] which is a 
generalization of the case f(x) = x due to Kamae and Mendés-France [15]. 


Lemma 2.8 ([3, Proposition 1.22]) Let f be a (non zero) polynomial with integer 
coefficients and zero constant term. Then the sets {f(p — 1):p € P} and {f(p + 
1): p € P} are vdC sets in Z. 


We show the converse direction. 


Lemma 2.9 Let a and b be nonzero integers. Then the set {ap + b:p € P} is avdC 
set if and only if |a| = |b], i.e, ap +b = a(p + 1). 


Proof Itis clear from Lemma 2.8 that {ap + b:p € P} is a vdC set if |a| = |b]. 
On the contrary a combination of Lemma 2.5 and Lemma 2.6 yields that a | b. 
Now we consider the sequence modulo b. Then by Lemma 2.5 we get that 


ap + b=ap =O0mod D infinitely often. 


Since (p,b) > 1 only holds for finitely many primes p we must have b | a. 
Combining these two requirements yields |a| = |b]. 


3 A Sufficient Condition 


In this section we want to formulate a general sufficient condition which provides 
us with a tool to show for plenty of different examples that they generate a vdC 
set. This is a generalization of the conditions of Kamae and Mendeés-France [15] 
and Bergelson and Lesigne [3]. Before stating the condition we need an auxiliary 
lemma. 


Lemma 3.1 ([3, Corollary 1.15]) Let d and e be positive integers, and let L be a 
linear transformation from Z4 into Z* (represented by an e x d matrix with integer 
entries). Then the following assertions hold: 


1. If Dis avdC set in Z4 and if 0 ¢ L(D), then L(D) is a vdC set in Z°. 
2. Let D © ZA, If the linear map L is one-to-one, and if L(D) is a vdC set in Z°, 
then D is a vdC set in Z4. 


Our main tool is the following general result. Applications are given in the next 
section. 
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Proposition 3.2 Let g,...,g%:N — Z be arithmetic functions. Suppose that 
8i,,---5 Ri, is a basis of the Q-vector space span(g1,...,g%). For each q € N, we 
introduce 


Dg := {(gi, (n), .++) i, (tin € N and q! | g;(n) for allj = Docismth 


Suppose further that, for every q, there exists a sequence (A) nen in D, such that, 
forallx = (x1,...,Xm) € R”\Q", the sequence (h -X) nen is uniformly distributed 
mod 1. Then 


D := {(gi(n),...,ge(n)):n € N} € Z 


is avdC set. 


Proof We first show that the set 


D:= {(gi,(n),..., gi, (m)):n € N} 
is a vdC set in Z”. For g, N € N we define a family of trigonometric polynomials 


pk 
Pun = ye (ni? -x). 


n=1 


By hypothesis, limy—+o9 Pgw(x) = O for x ¢ Q"”. For fixed q there exists a 
subsequence (P,,y) which converges pointwise to a function g,. Since gg(x) = 1 
(for x € Q” and q sufficiently large) and g,(x) = 0 (for x ¢ Q”), the sequence 
(gq) is pointwise convergent to the indicator function of Q”. For a positive measure 
o on the m-dimensional torus with vanishing Fourier transform 6 on D, we have 
[Pando = 0 for all g,N. Thus o(Q”) = 0 follows from the dominating 
convergence theorem, obviously o({0, 0,...,0}) = 0, and thus D is a vdC set. 

In order to prove that D is a vdC set we apply Lemma 3.1 twice. Since Bits +++ Sim 
is a base of span(gi,..., gx), we can write each g; as a linear combination (with 
rational coefficients) of g;,,..., gi,,. Multiplying with the common denominator of 
the coefficients yields 


Oj8j = Di ABI, +++ + Dj Bin 


forj = 1,...,k and certain a;,b;¢ € Z. Considering the transformation L: Z™ — Z* 
given by the matrix (b;) and applying part (1) of Lemma 3.1 shows that 


{(aigi(n),..., aggx(n)):n € N} 


is a vVdC set for certain integers a1,..., dx. 
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Now consider the transformation L: Z‘ — Zé given by the k x k diagonal matrix 
with entries a,,..., a, in the diagonal. Then by part (2) of Lemma 3.1 also D is a 
vdC set and the proposition is proved. 


4 Various Examples and Applications to Additive Problems 


In this section we consider multidimensional variants of prime powers, entire 
functions and x* log’ x sequences. 


4.1 Prime Powers 


In a recent paper the authors together with Bergelson, Kolesnik, and Son [4] 
consider sets of the form 


{(a1 (Pa + 1)%,...,a%( pp  1)*):n € N}, 


where a;, 8; € R and p, € P runs over all prime numbers. These sets are vdC, 
however, we missed the treatment of a special case in the proof. In particular, if 
for some i # j the exponents satisfy 6; = 6; =: 0, then the vector ( p®, p®) is not 
uniformly distributed mod 1. 

Here we close this gap. 


Theorem 4.1 [f a; are positive integers and B; are positive and non-integers, then 


Di = {((p- 1D)" (p— D™, [(p— DP], (p — DP) |p € Ph, 


and 


Dz = {((p+ D+ (p+ DY. [pt DP] Mp + DP) |p € P} 


are vdC sets in ZE*, 


Proof Since x®! and x” are Q-linear dependent for all x € Z if and only if 6, = 6), 
an application of Proposition 3.2 yields that it suffices to consider the case where 
all exponents are different. However, this follows by the same arguments as in the 
proof of Theorem 4.1 in [4]. 
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4.2 Entire Functions 


In this section we consider entire functions of bounded logarithmic order. We fix a 
transcendental entire function f and denote by S(r) := max,,)<, |f(z)|. Then we call 
A the logarithmic order of f if 


1 
1p WESH) _ 


r>co logr 


The central tool is the following result of Baker [1]. 


Theorem 4.2 ([1, Theorem 2]) Let f be a transcendental entire function of 
logarithmic order 1 < 4 < z. Then the sequence 


(F(Pan>1 


is uniformly distributed mod 1. 
Our second example of a class of vdC sets is the following. 


Theorem 4.3 Let f\,...,f% be entire functions with distinct logarithmic orders | < 
A1,A2, 2... AR < z, respectively. Then the set 


D:= ((LfilPn)]. +++» Lfe(Pn)]):1 € N} 


is a vdC set. 


Proof We enumerate D = (d,)n>1, where 
dn = (Lfi(Pa)],---+ Lfe(Pn)])- 
First we show that for every g € N the set 
D® := {(d,,..., dx) € Dig | dj} 


has positive relative density in D. We note that if 0 < {ites < 7 forl <i<k, 


then d, € D®. By Theorem 4.2 the sequence 


(4 poy) 
q aes q n>1 


is uniformly distributed and thus D™ has positive density in D. 


For each q € N we enumerate the elements of D) = (a) 1, such that lay” | 


is increasing. Since the logarithmic orders are distinct we immediately get that the 
functions f; are Q-linearly independent. Thus by Proposition 3.2 it is sufficient to 
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show that for all g € N and all x = (x1,...,x%) € R‘ ‘ Q* the sequence ca?” *X)n>1 
is u.d. mod 1. 

Using the orthogonality relations for additive characters we get for any nonzero 
integer h that 


1 
[{n < N:d, € D@}| & a h(dy? - x)) 


1 ji Jk 
- pare, > woe (% (m+ + eg))): 


ik=1 ener 
The innermost sum is of the form 
Y5 e(g(pn)). 
n<N 


with g(x) = 4 a | fi(x)| for a certain (a1,...,a%) € RF \ O@ 

By relabeling the terms we may suppose that there exists an € such that 
a,...,a@¢ ¢ Q and a4j,...,a@% € Q. Furthermore we may write aj = < for 
£+1<j<m.Then 


k £ k 
e((Pn)) = (d a) =| [ sioifi(pn).filpn) [] oie). 


i=1 j=l j=l 


where s;(x, y) = e(x— fy}a)) (I <j < O and4(z) =e (ae ) C+ 1 <j <0. 
Since s;(x,y) is Riemann-integrable on T? for j = 1,...,€ and 4(z) is 
continuous on Z, = Z/qZ, the function ic Sj ie 414 is Riemann-integrable 
onl x Zi, 
Now an application of Theorem 4.2 yields that for any u € N the sequence 


fest Po aes at 
n> | 


u 


(crhi(rn) firms SDS. 


atl 


Xx 
7 ], we 


is u.d. in T?’ x T*. Since |x| = a (mod q) is equivalent to = [S, 
deduce that 


(afi (Pn). fi(Pn), ++», Cefe( Pn). fe(Pn)» Lftti(Pa)l. ++. LAP) Dna 
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is u.d. in T? x Yim , and Weyl’s criterion implies that 


k 
an < de (>: o%; i) =i), 


n<N i=1 


proving the theorem. 


4.3 Functions of the Form x” log? x 


In the one-dimensional case Boshernitzan et al. [6] showed, among other result, that 
these sets are vdC sets. Our aim is to show an extended version for the k-dimensional 
case. Therefore we use the following general criterion, which is a combination of 
Fejer’s theorem and van der Corput’s difference theorem. 


Theorem 4.4 ({16, Theorem 3.5]) Let f(x) be a function defined for x > 1 that is 
k-times differentiable for x > xo. If f(x) tends monotonically to 0 as x > 00 and 
if limy+ooX |F® (x)| = 00, then the sequence (f(n))n>1 is u.d. mod 1. 


Applying this theorem we get the following 
Corollary 4.5 Leta # 0 and 


— either o > 0 not an integer and t € R arbitrary 
— oro > Oan integer and t € R \ (0, 1]. 


Then the sequence (an® log‘ n)n>2 is u.d. mod 1. 
Our third example is the following class of vdC sets. 


Theorem 4.6 Let a,...,a, > O and B,,...,Be € R, such that B; ¢ [0,1] 
whenever a; € Z fori = 1,...,k. Then the set 


D := {(|n™ log?! nj,..., |n® log®* n|):n € N} 


is a vdC set. 


Proof Following the same arguments as is the proof of Theorem 4.3 and replacing 
the uniform distribution result for entire functions (Theorem 4.3) by the correspond- 
ing result for n® log’ n sequences (Corollary 4.5) yields the proof. 
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Hypothesis. Certain integrals over Dirichlet series appearing in this approach can be 
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1 Introduction 


Properties of the Estermann zeta function have been used by Balasubramanian 
et al. [2] to prove asymptotic formulas for mean-values of the product consisting of 
the Riemann zeta function and a Dirichlet polynomial. These asymptotic formulas 
contain cotangent sums which also appear in recent work of Bettin and Conrey 
[4] on period functions. Very recently, Maier and Rassias in their paper [14] prove 
asymptotic results and upper bounds for the moments of cotangent sums under 
consideration. Their main result is the existence of a unique positive measure ju 
on R with respect to which these cotangent sums are equidistributed. 


2 The Cotangent Sum and Its Applications 


In the present paper, we consider the following cotangent sum: 


Definition 2.1 [fr,b ¢ N, b> 2,1 <r< band (r,b) = 1, we define 


The function co(r/b) is odd and has period 1. Its value is an algebraic number. We 
first exhibit some relations of the cotangent sums to the Estermann and Riemann 
zeta functions and connections to the Riemann Hypothesis. 


Definition 2.2. The Estermann zeta function E (s _ a) is defined by the Dirichlet 


>D 
series 


a(n) exp (2 inr/b) 
E(s 2a) =>" n) exp (27rinr 


ns 
n>1 


where Res > Rea+1,b> 1, (r,b) = 1 and 
nni= > e. 
d\n 


It can be proved that the Estermann zeta function can be continued analytically to a 
meromorphic function, on the whole complex plane up to two simple poles s = 1 
ands = 1+aifa@ #0 ora double pole at s = 1 if a = 0 (see [9, 11, 17]). 
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The Estermann zeta function satisfies the functional equation: 


1+a—2s 
E(s,.a) = -(=) rii—-s)rd+a-s) 


Qn 


(“)e iia T ( we VE 1+ T 
x = —5,5,a)— ==> =Sy= 5s : 
cos ,) a ‘ee cos ( 7s 5) a-—s 5° 


where 7 is such that 7r = 1 (mod b) and I(s) stands for the Gamma function. 
Balasubramanian et al. [2] used properties of E (0, -— 0) to prove an asymptotic 


formula for 
r= f(s eae A a 
= : 5 i 5 i 


where A(s) is a Dirichlet polynomial. 

Asymptotics for functions of the form of J have been used for theorems which 
give a lower bound for the portion of zeros of the Riemann zeta-function ¢(s) on the 
critical line (see [12, 13]). 

A nice result concerning the value of E (s, zi a) at s = O was presented by 
Ishibashi in [10]. 


Theorem 2.3 (Ishibashi) Let b > 2,1 <r<b, (r,b) =1,a€NU {0}. Then 
(1) If a is even, it holds 


2 2 


dt , 


where 5,9 is the Kronecker delta function. 
(2) If a is odd, it holds 


Ba 
EO? ea. 
b 2(a + 1) 
Ifr = b=1, one has 


(—1)°*' Bos 


ea en 


where B, denotes the Bernoulli number (see Definition 2.4). 


Hence for b > 2,1 <r<b, (r,b) = 1, we have 


cofa)=t+ia(f). 


where co(r/b) is the cotangent sum (see Definition 2.1). 


280 H. Maier and M.Th. Rassias 


The above result gives a relation between the cotangent sum co(r/b) and the 
Estermann zeta function. 

The cotangent sum co(r/b) can be associated with the so-called Vasyunin sum, 
which is defined as follows: 


where {u} = u—|u|,ueER. 
One can prove that (see [3, 4]) 


where, as mentioned previously, 7 is such that 7r = 1 (mod b). 
The Vasyunin sum [19] is itself associated with the study of the Riemann 
Hypothesis through the following identity (see [3, 4]): 


1 pec ie a] 2 rit dt log2x-y (1 1 
— Srl (2) —— = = sae a4 
ea ae i (5+#) ‘oe Ce) co 
b-r r W r b 

——— lee -= = Vi = sl 

yb ( () x v(2)) 

The only non-explicit function in the right-hand side of (2.1) is the Vasyunin sum. 
Formula (2.1) is related to the Nyman—Beurling—Baéz-Duarte—Vasyunin 


approach to the Riemann Hypothesis (see [1, 3]). The Riemann Hypothesis is 
true if and only if 


lim dy = 0, 
N. 


where 


2 dt 


a 2 
z+ 


dx as ~ c(=+ it) Lit 
= inf — —C(-+i —+i 
N Dy 20 Joo 2 MND 


and the infimum is taken over all Dirichlet polynomials 


From the above considerations we see that the behavior of co(r/b) helps to 
understand the behavior of V(r/b). From (2.1) we may hope to obtain some 
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information related to the Nyman—Beurling—Baéz-Duarte—Vasyunin approach to the 
Riemann Hypothesis. 


Definition 2.4 The m-th Bernoulli number B,, is defined by 


(2m)! 9 
Bay = 2 v ve Bom =0, 
2 (22)?™ De am+1 


v>1 


where m € N. Furthermore, we have B_; = —1/2. 


3 Main Result 


We now discuss the equidistribution of certain normalized cotangent sums with 
respect to a positive measure, which is also constructed in the following theorem. 


Definition 3.1 For z € R, let 
F(z) = meas{a € [0,1] : g(a) < z}, 
where “meas” denotes the Lebesgue measure, 


+00 
e(a) = 


I=1 
and 
Co(R)={f € C(R) : Ve > 0,45 a compact set K CR, such that | f(x)| < €, Vx ZK}. 
Remark The convergence of this series has been investigated by de la Bretéche and 


Tenenbaum (see [7]). It depends on the partial fraction expansion of the number a. 


We now state Theorem 3.2, the main result of our paper. An overview of the basic 
steps of its proof is provided in Sect.4. The only fact, whose proof is provided in 
greater detail, is stated in Lemma 4.8. It implies the continuity of the distribution 
function of the cotangent sums co. 


Theorem 3.2 


(i) F is a continuous function of z. 
(ii) Let Ag, A, be fixed constants, such that 1/2 < Ay < A, < 1. Let also 


7 1 g(x) 2k 
mmf (BP) a 


where Hj, is a positive constant depending only on k, k € N. 


282 H. Maier and M.Th. Rassias 


There is a unique positive measure jt on R with the following properties: 


(a) Fora < B € Rwe have 
(la, B]) = (Ai — Ao) (F(B) — F(@)). 
(b) 


[eu = ‘ —Ao)Akj2, for evenk 


; otherwise . 


(c) Forallf € Co(R), we have 


1 1 r 
lim — =co(—)) = | fdu. 
b>+00 b(b) = 1Ge@)) fr a 
Aob<r<A\b 


where (-) denotes the Euler phi-function. 


Remark Bruggeman (see [5, 6]) and Vardi (see [18]) have studied the equidistri- 
bution of Dedekind sums. In contrast with the work in this paper, they consider an 
additional averaging over the denominator. 


4 Outline of the Proof and Further Results 


Rassias [15, 16] proved the following asymptotic formula: 


Theorem 4.1 For b > 2,b €N, we have 


1 1 b 
. (;) = — blogh — (log 2x — y) + O(1). 
b a wv 


In that paper, the fractional parts are expressed in terms of cotangent sums. This 
method is generalized in the present paper, where some stronger results are being 
proved. 

In [16] also the following improvement of Theorem 4.1 is proved. It is not needed 
in the proof of Theorem 3.2. 


Theorem 4.2 Let b,n € N, b > 6N, with N = |n/2| + 1.There exist absolute real 
constants A,,A2 > 1 and absolute real constants FE), 1 € N with |E)| < (Ai), such 
that for each n € N we have 


1 1 b i < 
co (=) = —blogb — —(log2x — y) — — + y E\b' + R*(b) 
Tu u ei 
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where 
[Ri (b)| < (Aon) DOF”, 
In the following Proposition 4.3, due to the second author (see [16]), a relation is 


obtained between the cotangent sum co and the modified sum Q. Thus the study of 
Co is reduced to the study of Q, which is crucial. In [16] also Theorem 4.4 is proved. 


Proposition 4.3 For r, b € N with (r, b) = 1, it holds 


o(@)=t9({)-406) 


where 


0() = Yoo) [2 I. 


Theorem 4.4 Let r,bo € N be fixed, with (bo,r) = 1. Let b be a positive integer 
such that b = bo (modr). Then, there exists a constant C; = C,(r,bo), with 
Ci (1, bo) = 0, satisfying 


1 b 
& (-) = —blogh — —(log2x — y) + C; b + O(1), 
b mr qr 


for large integer values of b. 


We now list other results proven in [16]. In the sequel we shall give a few hints 
concerning their proofs. 


Theorem 4.5 Let k € N be fixed. Let also Ao, A, be fixed constants such that 1/2 < 
Ag < Ay < 1. Then there exist explicit constants Ex > 0 and Hy > 0, depending 
only on k, such that 


(a) 


r\ 2k 
D O(5) = Be AN Ai) o*$ GA + 0(1), > +00). 
ri(r,b)= 
Pecan 


(b) 


2k-1 
Y 2(F) = *90), > +00). 
oe. 
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(c) 


Sco (Z)" = He Ai -Avyo*HOH)A + 01), ( + +00), 


ri(r,b)=1 
Aopb<r<A,b 


(d) 


2k—1 
Py co(=) = 0 (b*'$(b)), (b> +00). 
Pct 


Applying the method of moments, we deduce detailed information about the 
distribution of the values of co(r/b), where Agb < r < A,b and b > +00. In 
fact, we prove Theorem 3.2. 

Finally, we study the convergence of the series 


SH 
k>0 

and prove the following theorem: 

Theorem 4.6 The series 
Hs 
k>0 

converges only for x = 0. 

Another interesting question is whether the series 


>» Ak 42k 
(2k)! ” 


k>0 


has a positive radius of convergence. This would lead to a simplification in the 
proof of our equidistribution result, since in this case we could apply the theory 
of distributions which are determined by their moments. We now give a few hints 
concerning the proofs of Theorems 4.4 and 4.5. 

By Proposition 4.3 we know that 


o(G)=!(2) 200). 


where 
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We partition the range of the above summation into intervals on which the term 
|rm/b| assumes a constant value /: 


o(t)=F XY cot(™) 


Oss F leer 


and define 
S; = {rm : bj < rm < bj + 1),me Z}. 
We write 
S; = {bj+ 5, bi +sj)+r,...,b) +5) + 4)r}, 


where d; € {0, 1} and introduce s = s; as a new variable of summation. 
The relation between j and s; is given by 


sj = —bj(modr) 


and thus for a given value s; = s we can find j by 
jf sbh* 
ae rj? 


l(modr), 1 < b* <r—1. 


where b* is defined by 


bb* 


Since cot(zx) has a pole of first order at x = 0, the sum Q(r/b) is dominated by 
small values of s. The substitution 


a=a(r,b) = — 
r 
and the asymptotics 


1 
cot(zx) ~ — 
UX 


lead to the approximation of Q(r/b) by 


b 
—" 9(a). 
IU 
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For the proof of Theorem 1.5 of [14], the crucial property is the continuity of the 
function F(z), where 


F(z) = meas{a € [0,1] : g(a) < z}. 


We shall give a proof of this fact shortly. The proof of Theorem 4.6 is obtained by 
studying the contribution of the interval 


I(k) = [ee] 


to the moment 


_ fi ga)" 
m= f (2) dx. 


Definition 4.7 A distribution function G is a monotonically increasing function 
G:R = (0, 1]. 


The characteristic function of G is defined by the following Stieltjes integral: 


+00 
v(t) = / el dG(u). 


co 


(cf. [8, p.27]) 


Lemma 4.8 The distribution function G is continuous if and only if the character- 
istic function satisfies 


1 T 
lim inf — t)|\’dt = 0. 
Toto 27 iz vO 
Proof See [8, p. 48, Lemma 1.23]. 
Definition 4.9 Lett > 1. We set 


Kak = [OL catg =|" Beko HP” | 


and 


oa, K) = aS zt) ies 237 nt) 


I<K I>K 


where B*(u) = u—|u|—1/2,ueER. 
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Assume that (a;) with 0 = a < a) < +--+ < dg = 1 isa partition of |0, 1] with 
the following properties: 


1 
ae < O41 —@; < 2R! 


and g(a, K) is continuous ata = a; for0 <i<R. 


We now make preparations for an application of Lemma 4.8 with G = F, and 


1 
W(t) = Ot) := ; e (2) da. 


Lemma 4.10 The function h(a) has a Fourier expansion 


h(a) = > c(n) sin(27na), 
n>K 
with 


Renee 
vie 


where t stands for the divisor function. 


Proof From the Fourier expansion 


n=—Co n 

n#0 
we obtain 

i i e(Ima) 
a) = 2 YY = > acme 
I>K gor |nl>K 
with 
d(n) = ——|{(I,m) : Im=n, 1 > K}}. 
Nn 

We have 


h(a) = S~ d(n) (e(na) — e(—ne)) = 21) d(n) sin(2xna), 


n>K n>K 


which completes the proof of the lemma. 
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Definition 4.11 We set 


h(a) := > c(n) sin(277na) 


K<n<L 


and 


h(a) = y c(n) sin(27n@). 


n>L 


Lemma 4.12 We have 


1 
i (« (5 (ele. K) + m(@))) ~¢ (2) da = 0(r-1™)., 


Proof By Parseval’s identity, it follows that for every € > 0 it holds 


1 
i: hy(ay’'da = SY > e(ny KL, 
0 


n>L 


because of the estimate 
c(n) <n! T€, 
Thus, for all w € [0, 1] not belonging to an exceptional set € with 
meas(€) = O (ee) F 
we have 


ho(a) = O (SD) 


c (=°) = ] _ 0 (1/1) 
20 


by the Taylor expansion of the exponential function. 


Hence, 
' (tg(@) Lee 
| e( az ) aw f e (5—(ea.K) + n(@))) da 


=f (eee “ee EHO) ||. (B@)- 1 
~ Jo 


and therefore 


da 
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th 
< [2a+ f e( SO) A a 
E [0,1]\E 20 


=O (te) : 
Lemma 4.13 There exists a setI C {1,...,R} of non-negative integers, such that 
Sai — aj) = O('/1) 
ie! 


and for i £ I, o& € [c;, 04] we have 
|hi (a) — hi(ai)| < po UF 1/100) 


Proof We have 


d 
Fr) = 2mnc(n) cos(27na@) 
om K<n<L 
and 
nig 
Fg) =— Ss 4x7n*c(n) sin(27n@). 


K<n<L 


By Parseval’s identity, for every € > 0 we get 


1 
I 
and by the Cauchy—Schwarz inequality, it follows that 

1 
i 


We now define the set J as the set of all subscripts i for which the closed interval 
[@;, @;41] contains an w with 


2 


 n@ da = O(L'**) 


da 


meas 


= O(LV?+) , 4.1 
aa da = O( ) (4.1) 


[hy (a) — hy (oe) > T1109), 


Since 


ie) =e) i, : Gh(Bdp (4.2) 
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and 
la—a;|=O(r*), 
it follows that for i € / there must exist 6 € (a@;,@;41) with 


= pis. 


d 
pra 


eee : : 2 : 
Because of the estimation of the Fourier coefficients of hy (a), we obtain 


a 2+ 
Analogously to (4.2) we obtain that 
d 1 
— hf > _p/s 
a 1(@)} > 7? 


for every a € [a;, @+1] and therefore 


i 
Gj 


From (4.1) we obtain that the measure of the union of the closed intervals [a;, a4] 
with i € Jis O(t—'/!), which concludes the proof of the lemma. 


= he) 


1 
7 da > xf Pain — aij). 


Lemma 4.14 We have 
lim @(t)= lim @(t) = 0. 
t>+00 t—->>—0o 
Proof We shall prove the result only for t + ++00, since the proof of the part when 


t —> —oo is analogous. 
By Lemma 4.12, we have 


1 1 
= [ (42) da = i e(=-(¢(@. K) + in(a@))) + 0(-™) 
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and thus 
1 R 7 Aji+1 
P(t) = [ e (=) da = bac (AS) a , 7 (4) . 
‘a 
+1 (te(a, K) th, (a) th (ai) 

EES) ((S2)--(42)) 
an 

+O (Des - «| +0 (om) ; 

i€l 


From Lemma 4.13 we get 


R Qi+1 
(1)= fe (82) aa =Jre (AS) f st (22%) da+0 (1-1/1), 


i=0 
i¢I 
(4.3) 


We now estimate 


[~ ES da 
” . Qn : 


for i ¢ I. Let J; — 1 be the number of discontinuities of the function g(a, K) in the 
interval [a;, @41]. Let Bio = oj, Biz, = oi41 and let the discontinuities of g(a, K) 
in [a;, @;+1] occur at the points Bj) < Bi2 <--- < Bigj-1 

In the intervals [6;, , Bj,.+1] the function g(q@, K) is a linear function, that is 


g(a, K) = d,—2Ka, 


where d,. € R. Therefore, 


i oe 


[ere tg(a, K) ae 
ir 20 


22 


Jj 


=O UK"): (4.4) 
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From (4.3) and (4.4), we get 


' (1g(a) | (2+! (t9(a,K) 
d < FOr se d O pi /too 
fear esd e( = ) aa] + ( ) 
idl 
R 
=O} 0K)" S04) + 0(0"™). 
Z 


The number of discontinuities of g(a, K) is O(K*), since each of the K terms 


B* (la) 
l 


has O(K) discontinuities in the interval [0, 1]. We thus have 


R 
So J = O(K’). 
i=0 

Then 

b() = O(F 1). 

Therefore 

lim @&(t) = 0. 
t++0o0 


Similarly, we obtain 
lim @(t) = 0, 
t>—0o 
which completes the proof of the lemma. 


Lemma 4.15 F is a continuous function of z. 


Proof This follows from Lemmas 4.8 and 4.14. 
Thus, part (1) of Theorem 3.2 is now proved. 


Acknowledgements We express our thanks to the referee for providing valuable remarks. 
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A Turan-Kubilius Inequality on Mappings 
of a Finite Set 


Eugenijus Manstavicius 


Zur Erinnerung an Professor Wolfgang Schwarz 


Abstract Similarly as in number theory one may define the notion of an additive 
function in the set of all mappings of a finite set into itself. If a mapping is sampled 
uniformly at random, the function becomes a sum of dependent random variables. 
Estimation of its variance via the sum of variances of the summands is a non- 
trivial problem. We give an answer analogously to the Turan-Kubilius inequality, 
well known in probabilistic number theory. 


Keywords Additive function * Assembly * Random mapping * Second 
moment ¢ Turdn-Kubilius inequality 


2010 Mathematics subject classification: Primary 60C05; Secondary 05A15, 
11N37 


1 Introduction and Results 


The present note concerns random mapping statistics which share many analogies 
with probabilistic number theory, one of the favourite research fields of Professor 
Wolfgang Schwarz. During frequent visits to the Goethe University, the author felt 
the host’s interest and encouragement to study a theory of functions defined on 
decomposable combinatorial structures. It was not a coincidence, that our paper 
[9] dedicated to his 70th birthday discussed an analytic approach to explore mean 
values of multiplicative functions defined on mappings of a finite set into itself, and 
consequently, to prove limit theorems on value distribution of additive ones. Here 
we are concerned with the same object but the main target is to prove an analogue of 
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the Turan-Kubilius inequality (see [7], Lemma 3.1) for the second central moment. 
A historical survey of the results for these mappings can be found in [9] and the 
references therein. 

Let us introduce some notation. By Ty we denote the set of all mappings g from 
the set {1,...,N} into itself. A mapping g can also be understood as a labelled 
directed graph Gy, on N vertices which we call a functional digraph. In Gg, an edge 
from i to j exists if and only if y(i) = j. Many properties of gy can be expressed 
via the component vector k(y) = (ki(g),..., ky(@)), where kj(@) is the number of 
connected components of order (number of vertices) j in Gy. By definition, 


Iki(g) +--+ + Nky(g) =N. (1.1) 


Let vy(...) = N “fg € Ty: ...}| be the uniform probability measure on Ty. 
Observe that relation (1.1) makes the variables kj(y), 1 < j < N, dependent with 
respect to vy. Given a real two-dimensional array {hj(k)}, where j > 1 and k > 0, 
such that h;(0) = 0, we define an additive function h(¢) in Ty by setting 


N 
h(g) = D> Ail). (1.2) 

j=l 
If hj(s) = sa; with some a; for each j,s > 1, then the function h(¢) is called 


completely additive, and equals just a linear combination of kj(~). The number of 
components w(¢) := ki(g)+--:+ky(@) is the simplest example of such a function. 
Changing the sum in (1.2) to the product, in a similar manner, we obtain the notion 
of a multiplicative function in Ty. 

Denote by m; the number of connected functional digraphs of order j > 1. It is 
known [1, 4] that 


1 
2j 


8 
Es (1.3) 


jolly 
mj >= (Gi DIDS arte! a 
s=0 
if j = 1. The last inequality was proved by J.C. Hansen [5]. Because of (1.3), the 
class of mappings of all orders Y := UysoTy, where To = {9}, is logarithmic 
(see [1]); therefore, many general results obtained for this class (see [1]) or its 
generalization (see [11]) are at our disposal. Let r, 1 < r < N, be a parameter and 
write k,(y) := (ki(¢), ...,k,()) for a shortened component vector. If &,1<j<r, 
are independent Poisson random variables (r.vs) defined on some probability space 
(2, #,P), EE = A; and é. := (&,...,&,), then we have [1, 11] the following total 
variation distance estimate 


x 


SrENH 


rn(li(g) = 51) — PE. = 5)| « (Z)\ (4) 
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where c > 0 is an absolute constant. Here and afterwards < stands for the symbol 
O(-) with an absolute constant in it and X¥ = Y means that X < Y and Y <« X. So, 
the value distribution problems concerning the truncated additive function defined 
only via k,(g) if r = o(N) as N -> oo, can be easily transferred to probability 
theory. To estimate the influence of the remaining part of an additive function 
defined via kj(g~), r < j < N, one can use the author’s inequality (see [8], Lemma 6) 
for the tail probability 


So hi(&) — a 


JEN 


vy (lh(g) — al =u) « pi 


1 
=u/3) +, aceR, uve RR. 
7 JN 


Moment estimates would be very useful at this stage. Unfortunately, we do not have 
them so far. In contrast, moments of an additive function defined on permutations 
have been intensively studied (see [10, 12] and the references therein). Partly filling 
up the gap, in the present note we will analyse the variance 


Vvh(y) = N-N S~ (R(y) — Enh)’, 


geTy 


where Ey denote the expectation with respect to vy. 
It is worth starting with a few known identities which can be deduced using the 
exponential generating series 


ws = J 
Z(z) a 1+ ye oP} y th 


=1 


u. 


corresponding to the class .7. Firstly, observe that 


Yom ye (2)'4 


L(s)=NjSN 


Secondly, 


w(K) =3) = TT (F)"5 (1.5) 


! ! 
pear sj! 


for arbitrary vector 5 € Nj satisfying £(5) = N, where 


NN eN 1 
QIN) = 5 = —(1 es o(5)) (1.6) 
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by Stirling’s formula. This gives the rough estimate 

JO(N — j = \-1/2 

e/2( een) ~ (1 ui ) Peyawn, (1.7) 
Q(N) 


which we will frequently apply in the sequel. 

Formula (1.5) defines the distribution of the component vector supported in the 
region £~'(N). Its applications provide formulae for the factorial moments of kj(¢). 
Set Q(0) = 1, (x)o = 1, and (x), = x(x- 1)---(®@-—r+ 1) ifr Ee N. We have [1] 


N — jr) pmj\r 
Ey (kj(¢)), = a e, (1.8) 


provided that jr < N and Ey(kj(9)),. = 0 otherwise. Moreover, 


Ev (k(¢)), (ki(e)), = om (ey (=), (1.9) 


provided that i # j and jr + il < N and Ey (ki(g)) (ki(g)), = Oifi # j and 
jr +il > N. By (1.6), 


Ey(K(y)),~ A}, Ev(K()), (ki(@)), ~ FAI 


provided that i, 7, and / are fixed natural numbers and N — oo. This, in addition 
to (1.4), shows asymptotic closeness of k;(@) to the Poisson r.v. §. Finally, we recall 
that the stochastic behaviour of various statistics defined on random mappings is 
similar to that defined on random permutations sampled according to the Ewens 
probability with parameter 0 = 1/2. Therefore the approach applied in paper [12], 
exploring in particular this case of permutations, plays an important role. Here, we 
must also acknowledge the influence of the number-theoretical paper [2]. 

For the sake of simplicity, our first theorem is stated only for a completely 
additive function defined via hj(s) = sa; with arbitrary a; € R, where j < n and 
s > 0. Define 


B°(N) := )> +—_~— 


JSN 


ave! Q(N — j)Aj 


-1/2 
aT “y 2 “(1 I 


jen J 


The last estimate follows from (1.3) and (1.7). To motivate the use of B(N) we 
include the following estimate. 
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Proposition If 


aze!Q(N — j)Aj 
O(N) 


’ 


R(N) := > 


JSN 


Vw (ajkj(y)) — 


then R(N) « B?(N)/JN. 


Theorem 1.1 For an arbitrary completely additive function and N = 1, we have 
Vwh(c) < B°(N). (1.10) 


In the general case, it is natural to use instead of B?(N) the quantity 


h?(k)A* ekO(N — jk) h?(k) jk \-122 
PO) = ony <a ge owa) - 


jk<N JKSN 


We introduce the Mth coefficient of the product of two power series by 


QO” (M) := [z"]Z(z) exp { — me 7 ye IT aa 


£(s)=M i<M 
950 


provided 1 <j < M < N and Q\)(M) := Q(M) if j > M. 


Theorem 1.2 For an arbitrary real additive function given in (1.2) and all N = 1, 
we have 


h(k)Ak ek OW (N — j 
E,h(y) = )~ a ONS (1.11) 


Pat k! Q(N) 
and 


Vyh(~) « D?(N). (1.12) 


The best constant in inequality (1.10), or in (1.12), is the maximal eigenvalue 
of the involved symmetric matrix. We guess that its numerical estimate is 5/3 + 
o(1) as N > ov. Following P.D.T.A. Elliott [3], one can find a corresponding dual 
inequality for either (1.10) or (1.12). Adopting the well-known Rényi’s approach 
proposed in number theory, one can present a simpler proof of Theorem 2 in [9] by 
establishing an analogue of Delange’s theorem for multiplicative number theoretical 
functions. The list of possible applications of our inequalities could be continued 
further. Their proofs are presented in the next section. 
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2 Proofs 


Proof of the Proposition We deduce from (1.8) and (1.9) that 


e/Q(N — j)Aj 


. I<; 2.1 
ON) J< (2.1) 


Eykj(g) = 


Eyk;(¢) = Eykj(y) + 1{j < N/2}En(kj(9)), 
elQ(N — j)Aj eIQ(N — 2A; 


= NS as; 23} —____ 2.2 
OW) + Uj < N/2} OW) (2.2) 


and 


OWN -DAy _ ay 5 eFQ(N — 2j)Az 
ow ~ 4 SN/2}—oay 
cD (N — ja; 


Q?(N) 


Vuvki(g) — 


Hence 


e/Q(N — 2j)  e/Q?(N —j) 
O(N) Q@(N) |! 


RIN) = >) a@ 


JSN/2 


,e7/Q? (N —j) 2 
FH 
“7 24 Ow “ 


By virtue of (1.7), the last sum in R(N) is majorised by 


a> g A Q(N — ee I (, a=) 


wien FON) N+1 
a? JQ(N —j 
ei 4 POW) D x N7'/2B2(N), 
N/2<j<N J Q(N) 


The same argument applies and the estimate holds for the partial sum in R(N) over 
N/4 <j < N/2. Finally, combining (1 —x)~!/? = 1+ x/2 + O(2?) where 0 < x < 
1/2 with the asymptotical formula (1.6), we obtain 


3 qjAi |e QW -2/) _ ON -)) 
J Q(N) Q?(N) 


jsN/4 
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< NT! pa a;hj < N7'B?(N). 
i<N/4 


Collecting the estimates for the partial sums we furnish the proof of the proposition. 


Proof of Theorem 1.1 Seeking upper estimates we may deal with a; > 0 andj < N 
only, and later combine the results for positive and negative parts of h(o) with the 
inequality (a + b)* < 2a? + 2b’, where a, b € R. Let us calculate the moments of 
h() defined via nonnegative a;. We deduce from (1.9), (2.1) and (2.2) that 


yw, elO(N-j) 
Exner 2 ah ON) 
and 
HiQ(N —i-j) 
Eyh2(y) = B2(N) + sn 
dX met Q(N) 
Hence 


Vwh(~) = Eyh?(g) — (Ewh(g))” 


i+j a4 =f7 
= B’(N) = Pa Pen JQ(N iQ(N J 


a ON) 
i+j>N 
af ON =i-) .QN= | 
iajAjAjeiti — 2.3 
* 2, aaneve (err, PUN) — 


We now omit the non-positive terms on the right-hand side of (2.3) to obtain 
Vvh() < BN) 


ae ow =F _ QW How py" 
# 2 sane ( O(N) 0?(N) . 
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Here x* denotes the positive part of x € R. The inequality ab < (a? + b*)/2 for 
a,b € Rx implies the further relation 


Vyh(g) < B’(N) 


. Q(IN-i-j) QW-)OW-j\* 
pa pre! ae Om O(N) ) 


J<N ix<N-j 
We now see that an inequality 


i =F 
A(M) := ( mee De )<(omom— 0 - ow - noun) 


i<M/2 M/2<i<M 


=: A\(M) + A2(M) « Q(N)Q(M), (2.4) 


where M := N —j > 1, suffices to complete the proof. By virtue of (1.7), we have 


O(N)O(M) Ps i y” 


oe M/2<i<M ae 


A,(M) « «K Q(N)Q(M). 


To estimate the sum A,(M), we twice apply asymptotical formula (1.6). This and 
the inequality (1 — x)~!/2 — 1 < 2x for 0 < x < 1/2 yield 


A\(M) = Q(N)Q(M) >> (1 7 iy a o()) 


i<M/2 


0-9) "(4 oG)T 


= mam) > -0(+) < amo. 


i<M/2 


Adding the estimates of A;(m) and A2(m), we obtain (2.4). Theorem 1.1 is proved. 


Proof of Theorem 1.2 Observe that we may take hj(s) = 0 for js > N, since these 
values never appear in the moments of an additive function h(gy). Moreover, it 
suffices to deal with nonnegative functions h(g) only. Formula (1.5) yields 


1 AfeltQO Ay — jy 


vn (ki(~) = - om ” 2G at si! = omy 


Oi Pal i<N 
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Hence 


= hy(k)A¥ elk QW (N — jk) 


Eyh(y) = )> hk) oy (ki(g) = &) = i O(N) 


jk<N JKSN 


as claimed in (1.11). 
Further, define 


evan © IG) ss 


3)=M r<M 
sp=sj=0 


provided that 1 < i <j < M < N, O“)(M) := OY(M) fori < M <j < N, and 
Q") (M) := Q(M) if M <i <j. If il+ jk < N andi ¥ j, then 
Aire" HOG) (N — il — jk) 


vy (ki(y) = L(g) = k) = 'k!O(N) 


and 


Eyl? (y) = D> hi(Dhj(k) vy (ki(Y) = 1(Y) = k) 


ij<N 


se ek QW (N — jk) 
- pea 


yen O(N) 
3 hi(Dhy (KA At ell tik QD (N — il — jk) 
Pomee Lk! O(N) 
iFj 


Omitting a part of the nonnegative summands we have 


x Abhj(D) e QO (N = il) AFhj(k) O(N — jk) 


2 
(Eva()) > i! O(N) kl O(N) 


jk+il<N 


The last two relations imply 


Maj) jh) 
l! k! 


Vwh(y) <D?(N)+ >> 
jktil<N 
seoiltik ( QIN = i= Jk) QIN — J OOW - id 7 
O(N) a(n? 
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Mh? (k)el* 
< D?(N) A cies Duieiees 
pie: 
x ee et 710 ey 
w<n—je | Q(N) aw)’ 


In the last step, we again applied the inequality ab < (a* + b?)/2. It remains to 
estimate the inner sum. Namely, we have to prove that 


Aj (QO (M il) QP (M) e'QOW a 
S(M):= >. 4( = —_a 1 2.5 
os Ln al O(M) QM) QIN) i. = 


for 1 <M := N—jk < N andj < N—1. The sum over M/2 < il < M does not 
exceed 


1 
OM) ae i x= 0M =i9 


M/2<il<M 
1 Ae" lei 
~ Dep 2 2 Le ost 
M/2<il<M I = M—il r<M— 


qj 


The estimation of the remaining sum in (2.5) over iJ < M/2 is more subtle. We 
need asymptotic formulae for the quantities in the difference 


QO) (M — il) — QY(M) e"QO(N — il) 
Q(M) Q(M) Q(N) 


RGD = 


with remainders estimated uniformly in j < N — 1. For brevity, let 


qy=e"O(n), — gP(ny=e"O% (Qn), @@(n) =" M). 
In our notation, 
ye (n)z" = 4" S alme, 
n=0 
where the two series converge in |z| < 1. Let 


CO 


eid! = se Cm()Z", 


m=0 
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1 


then cn(j) = 0 if j 4m. Moreover, since A; < Cj~’ with an absolute positive 


constant C, we have c,(m) = —Am «m7! ifm > 1 and 
Cey\™/i 1 
‘m /)) = 7S (= ) = 2.6 
|em(a)| = A; in = « KS (2.6) 


provided that m > 1, j|m, j # m and hence j < m/2. Applying the routine 
convolution argument, we firstly split 


an) = ( a as ext datn =f) = 1 + Ea, 
O<k<n/2 


n/[2<k<n 


then using (1.6) and (2.6), we deduce that 


1 ' ; 
_— vere rea! WO Tie v2 Fen! ail ( Fe -—=)+ 0(=s) 
ei log(n + 1) Aj log(n a 1) 
Vian = (2) =e" aio(1 - omen”) 


uniformly inj < n. For the remaining sum we observe 


1 1 1 1 
By KUn/2<j<ni-+ = YO a K«K-. 
J 1<k<n/2 Vk a 
Consequently, 
: -%. 1 
g(n) =e ‘gto)(1 a: o(=)). (2.7) 


The same argument, by virtue of a relation 


gi? (n)z" = ew aid! Me g?) (n)z" 


n=0 n=0 


and the just obtained estimate (2.7), gives 


ij — aay ni i — ahin-aj 
gD(n) =e 4)(1+0(—)) = ei ~ain(1+0(=)). (2.8) 
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Inserting (2.7) for n = M andn = N — il together with (2.8) forn = M —il > M/2 
into R(i,j), we obtain 


RGD = oa ( Sis o()) 


q(M) q(N) M 
eh it 1/2 _ il -1/2 1 
Ke (( —) (i=o) eS 
<K eo 


provided that il < M/2. 
Returning to (2.5), we observe that 


1 
S(M) «1+ yaa “RD 1+ yo i oye 


gene Me 


1 . 
Kl+a ae 


i<M/2 


This completes the proof of Theorem 1.2. 


Concluding Remarks The applied approach still works if a mapping g € Ty is 
sampled according to a weighted multiplicative probability Py ({g}), say, 


Py({p}) = =T1¢ am 


N jn 


where 6; > 0 are constants satisfying some regularity conditions and Cy stands for a 
normalizing sequence. In recent years a few new models of random mappings were 
introduced (see, for example, [6] or more recent papers); however, they do not fall 
into the discussed class. 


Going along the path, analogues of the Turan-Kubilius inequality can be obtained 
for classes of assemblies (see [1] for a definition) generalizing permutations and 
mappings. For the so-called additive arithmetical semigroups, fairly general results 
have been obtained by S. Wehmeier [13]. Estimations of general moments of 
additive functions defined on mappings, similar to that elaborated in number theory 
by P.D.T.A. Elliott, A. Hildebrand, I.Z. Ruzsa, K.-H. Indlekofer and I. Katai, would 
be highly desirable. 
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Abstract Adolf Hurwitz is rather famous for his celebrated contributions to 
Riemann surfaces, modular forms, diophantine equations and approximation as well 
as to certain aspects of algebra. His early work on an important generalization of 
Dirichlet’s L-series, nowadays called Hurwitz zeta-function, is the only published 
work settled in the very active field of research around the Riemann zeta-function 
and its relatives. His mathematical diaries, however, provide another picture, namely 
a lifelong interest in the development of zeta-function theory. In this note we shall 
investigate his early work, its origin, and its reception, as well as Hurwitz’s further 
studies of the Riemann zeta-function and allied Dirichlet series from his diaries. It 
turns out that Hurwitz already in 1889 knew about the essential analytic properties 
of the Epstein zeta-function (including its functional equation) 13 years before Paul 
Epstein. 
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1 Hildesheim: The Hurwitz Zeta-Function 


Adolf Hurwitz was born in Hildesheim on 26 March 1859 into a Jewish family of 
merchants. At the age of 17 he had his first publication in mathematics, a joint paper 
on enumerative geometry with his Hildesheim school teacher Hermann Caesar 
Hannibal Schubert, a well-known geometer of the time. On promotion of Schubert 
young Hurwitz started his studies at the polytech university of Munich! in 1877 
under the supervision of the eminent Felix Klein; soon after he moved with Klein to 
Leipzig where he obtained his doctorate in 1881. 


Being a young doctor, in the winterterm 81/82 he moved to the University of Berlin to 
improve himself by Weierstrass and Kronecker with whom he also had personal contact. 
Weierstrass was in particular interested in his doctoral thesis and his function-theoretical 
efforts, he also proposed the topic for his habilitation thesis.” 


wrote Ida Samuel-Hurwitz, the later wife of Adolf Hurwitz in her dossier [96]. At 
Leipzig first obstacles appeared. According to Rowe: “to pursue an academic career 
meant applying to habilitate, and the Leipzig regulations required applicants to be 
graduates of a humanistic gymnasium. Klein must have realized that this stipulation, 
together with Hurwitz’s Jewish confession, virtually ruled out any chance of his 
habilitating in Leipzig.”, [94, p. 23].? Consequently, Hurwitz moved to Géttingen 
for his habilitation in 1882. 

His first thoughts about the Riemann zeta-function and related questions fall 
into this period. At this time of his career Adolf Hurwitz often visited his nearby 


Called the Kéniglich Bayerische Technische Hochschule Miinchen which had been renamed to 
Technische Universitat Miinchen in 1970. 


°“Fiir das Wintersemester 1981/1982 bezog er als junger Doktor nochmals die Universitit Berlin, 
um sich bei Weierstra8 und Kronecker, zu denen er auch in personliche Beziehung trat, noch 
weiter zu vervollkommnen. Weierstra interessierte sich besonders fiir seine Dissertation und 
seine funktionentheoretischen Bestrebungen, auch gab er ihm das Thema fiir die Habilitation 
daselbst...”, [96, p. 6]; this translation from the original German quotation to English as well 
as those in the sequel are due to the authors; we have neither corrected the German language to the 
grammar used nowadays nor have we tried to translate into old-fashioned English. The mentioned 
university is the Humboldt University, before 1949 called Friedrich-Wilhelms-Universitat. 


Even stronger is a statement of Ida Samuel-Hurwitz about anti-Semitism at Leipzig University 
during Hurwitz’s lifetime: “Obgleich sein mangelhafter Gesundheitszustand natiirlich bekannt 
war, trachtete man in Deutschland mehrfach, ihn wieder dorthin zu ziehen. Als gleichwertig 
mit seinem Ziircher Lehrstuhl, den nacheinander eine Reihe der hervorragendsten Mathematiker 
bekleidet hatten, konnten freilich nur Berlin, Miinchen und Leipzig in Frage kommen. An die letzte 
Universitat war die Berufung eines Juden ausgeschlossen; fiir Berlin war er bei Vakanz an erster 
Stelle vorgeschlagen. Von Gottingen war natiirlich 6fters die Rede, doch scheute er einerseits den 
mathematischen Betrieb dort, wahrend ihm andererseits die Kleinstadt unsympatisch war.” [96, 
p. 11]. English translation: “Although his defective health was known there were attempts to get 
him back to Germany. As equivalent to his Zurich chair only Berlin, Munich and Leipzig could 
be considered. To fill a vacant professorship with a Jew at the latter university was impossible; for 
Berlin he was recommend for the first place in case of a vacancy. Of course, Gottingen was often 
under discussion, but on one side he disliked the mathematical business there, on the other side he 
considered the small town unsympathetic.” 
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Fig. 1 Adolf Hurwitz (1859-1919) in between his teachers Felix Klein (1849-1925) on the /eft 
and Karl Weierstrass (1815-1897) on the right. The photographs seem to be taken around 1881. 
Besides Klein, Hurwitz had been significantly influenced by the Berlin school 


hometown Hildesheim. From several letters to his friend from his studies at Munich 
University (and later renown analyst) Luigi Bianchi one can get interesting insights 
about this period. In a letter from 25 May 1882, Adolf Hurwitz had already finished 
his habilitation and had been working as a private docent in Géttingen, he wrote to 
Bianchi: 

Instantly I am visiting home for Pentecost which I can allow myself since Hildesheim can 

be reached by train from Gottingen within 2 hours (...) In Gottingen, I live a pleasant life. 

Prof. H.A. Schwarz is very kind to me and Dr. v. Mangoldt, who is a privat docent for 


mathematics as well, is a very noble character. Furthermore, in math at Gottingen there is 
Prof. Stern and Prof. Schering.* 


In particular Hans von Mangoldt (1854-1925) is interesting here. He played an 
important role in zeta-function theory and its application to questions about prime 
numbers; his contributions in the 1890s turned out to be essential for the first proof 
of the prime number theorem in 1896 (details in Sect. 2). Here, the Riemann zeta- 
function 


C(s):= Don = 14254354454... 


n>1 


is building the bridge between primes on the arithmetical side and the powerful 
tools of complex analysis on the other side. It was young Adolf Hurwitz who 


4“Augenblicklich bin ich auf Pfingsten zu Besuch zu Hause, was ich mir um so eher erlauben kann, 
als Hildesheim von Gottingen aus in 2 Stunden per Eisenbahn zu erreichen ist. (...) In Géttingen 
fiihre ich ein sehr angenehmes Leben. Prof. H.A. Schwarz ist sehr nett gegen mich und Dr. v. 
Mangoldt, welcher ebenfalls Privatdoc. fiir Mathematik ist, ist ein sehr nobler Character. Ferner 
sind von Mathemat. in Gottingen noch Prof. Stern und Prof. Schering.” 
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invented already in 1881 a simple but important generalization, the nowadays so- 
called Hurwitz zeta-function, given by 


C(s,a) = D(m+a)™, (1.1) 


m>0 


where @ is a real parameter satisfying 0 < a < 1. Actually Hurwitz just considered 
the case of rational a which is sufficient for applications to number theory; however, 
for most studies general real numbers a do not cause big trouble. We notice that the 
series representation from above is absolutely convergent for Res > 1, uniformly 
in any compact subset. 

We believe that the years 1881-1884 played a very important role in Adolf 
Hurwitz’s life. Of course, he did his doctorate and his habilitation in this period, and 
in addition he started his important investigations on several topics different to the 
themes of his doctoral and habilitation theses (on modular forms). With his studies 
of the Hurwitz zeta-function as well as of real and complex continued fractions 
he emancipated from his advisor Klein (as well as from his mentor Weierstrass in 
Berlin) and became an independent mathematician. Concerning continued fractions 
this is well documented in Oswald and Steuding [78]. In this chapter we focus on 
Hurwitz’s work on the Riemann zeta-function and its relatives. 


1.1 Prehistory: From Euler to Riemann 


The infinite series 1 + 2-* + 3°* +... defining the zeta-function is linked with the 
name Leonhard Euler in many ways. Starting from 1735, he gave various proofs 
of his famous formula ¢€(2) = — in 1743 he found explicit values for €(s) at the 
positive even integers [24], namely 

(27)** Box 


B d ee 
Tape «and SL 2k) = aE 


£(2k) = (-1)"! 


both valid for integral k, although Euler had no convergent representation for €(s) 
when s is a negative real. The numbers B,, are the Bernoulli numbers. Concerning 
the functional equation linking the values ¢(s) and ¢(1 — s), Euler claimed that 
there is no doubt about the truth of this conjecture [25].> Most importantly, although 
elementary, in 1737 Euler [23] discovered the analytical analogue of the unique 
prime factorization in terms of ¢(s) by observing 


6s) = Son’ =[Ja-p sy". 


n>1 peP 


>Euler’s approach has led to a powerful tool in theoretical physics, called regularization; it was 
Stephen Hawking [42] in 1977 who was the first to use the Riemann zeta-function by this method 
in order to compute certain quantities in quantum field theory. 
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valid for Res > 1 (the half-plane of absolute convergence for both, the series and 
the product), where P denotes the set of prime numbers. The appearing product 
is named after Euler and had been used by himself to deduce from the divergent 
harmonic series (appearing as €(s) with s > 1+) the existence of infinitely many 
primes by his famous formula 


Eds aed k log 
> - - _ ».. = 102102 CO 
a° 9°39 a" 


(which nowadays is written as neeper ; ~ log log x).° 


The function ¢(s), however, is named after Bernhard Riemann who was the first 
to study the behavior of ¢(s) as a function of a complex variable in his path-breaking 
nine pages article [93] written at the occasion of his admission to the Prussian 
Academy of Sciences in 1859. He proved analytic continuation for ¢(s) to the whole 
complex plane except for a simple pole at s = 1 and Euler’s conjectured functional 
equation’ 


ee a (5) f(s) = na (- =) cUl=5); 


Indeed, Riemann gave two rather different proofs of the functional equation, the 
first one by use of a certain contour integral (which will be the main topic of this 
section when we shall analyze Hurwitz’s paper) and another one based on Poisson’s 
summation formula and Jacobi’s transformation formula for the theta-function.® It 
follows from the functional equation and basic properties of the gamma-function 
that the nontrivial zeros (1.e., the non-real zeros) are symmetrically distributed with 
respect to the real axis and the so-called critical line Res = 5: However, Riemann’s 
paper [93] is probably more important with respect to its not rigorously proved 
statements and conjectures. Nowadays this very memoir has been considered as a 
signpost for developing complex analytic methods in order to reveal distribution 
properties of the prime numbers. 


Analytic number theory may be said to begin with the work of Dirichlet, and in particular 
with Dirichlet’s memoir of 1837 on the existence of primes in a given arithmetic 
progression. 


This quotation is the first sentence in Davenport’s monography on multiplicative 
number theory [13], referring to the celebrated work of Johannes Peter Gustav 


®Tt should be noted that Euler used slightly different notation. For more details on Euler’s work we 
refer to Weil [112]. 

7According to Landau [58], the only other mathematician to have noticed Euler’s work on this 
topic had been Cahen [9]. This statement is refuted by Malmstén [70], nevertheless some of Euler’s 
works were probably not easily accessible in this time. Only the era of mathematical journals made 
an easy exchange of mathematical ideas possible. 

8Tt was shown by Hamburger [38] that a functional equation of the Riemann-type is equivalent to 
both, the Poisson summation formula and a theta-function transformation formula. 
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Fig. 2. From /eft to right the pioneers of analytic number theory: Leonhard Euler (1707-1783), 
Peter Gustav Lejeune Dirichlet (1805-1859), and Bernhard Riemann (1826-1866). Beards seem 
to have been pretty much in fashion in the nineteenth century 


Lejeune Dirichlet on the distribution of primes in arithmetic progressions, resp. 
prime residue classes around 1837, about one generation before Riemann. In fact, if 
one seeks for primes in the residue classes modulo 4, one observes a certain uniform 
distribution of the primes in the prime residue classes: 


4Z+1:...15 9 1317... 
4Z+2:...26101418...< just one prime 
4Z4+3:...3711 1519... 
4Z+0:...4812 1620... <no prime at all 


Notice that Euclid’s celebrated proof of the infinitude of primes cannot be extended 
to prove the existence of infinitely many primes in an arbitrary prime residue class 
amod m; actually, as proved by Ram Murty (see [76] for a recent presentation) 
such an elementary proof is possible if and only if a7 = 1 mod m. A first attempt 
for proving an infinitude of primes in a prime residue class was given by the 
unfortunate Adrien Marie Legendre [64]; however, his proof was erroneous, and 
this was pointed out by Dirichlet [18] himself. Before we outline Dirichlet’s work on 
primes in arithmetic progressions we shall briefly mention the circumstances under 
which Dirichlet wrote his landmark paper [18] on this topic (building on Elstrodt’s 
biography [21]). 

His given name Lejeune means literally the young and his surname Dirichlet 
provides information about the origin of his family being from Richelette, a small 
town in Belgium.? His school and university education benefited a lot from the 


There is something funny about his birthplace: Dirichlet was born in Diiren, in between Aachen 
and Cologne, and the house his mother gave birth is Weierstrafe 17; notice that Strafe is German 
for street and the mathematician and contemporary Karl Weierstrass is written Weierstraf in 
German. 
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reform of the Prussian educational system implemented by Wilhelm von Humboldt. 
In 1828, his brother Alexander von Humboldt used his influence to order the young 
Dirichlet to Berlin.!° Starting from 1831 Dirichlet had a research position at the 
University of Berlin!’ and in the following year he was elected member of the 
Prussian Academy of Science. In the same year he married Rebecka Mendelssohn, a 
sister of the composers Fanny Hensel (the grandmother of the number theorist Kurt 
Hensel) and Felix Mendelssohn Bartholdy.'? For the last 4 years of his short life 
Dirichlet inherited Carl Friedrich Gauss’ chair at the University of Géttingen. 

One of his major results is the proof of the existence of infinitely many prime 
numbers in arbitrary arithmetic progressions a + mZ with (of course) coprime a 
and m. The main idea is the use of the so-called Dirichlet characters, i.e., group 
homomorphisms y : (Z/mZ)* — C* in order to sift a prime residue class a mod 
m. Extending these characters to arithmetical functions n +> y(n) defined on N (by 
letting y(n) = 0 for n not coprime with m), he defined associated Dirichlet series by 


Le, 0 =| [G-x@p")! = Do a@n: 


peP n>1 


the representations converge for Res > 1 absolutely and the identity between the 
product and the series is (as in the case of the Riemann zeta-function which appears 
essentially as L(s, ¥o) with the principal character yo being constant one) nothing 
but the unique prime factorization in analytic disguise. Using this in combination 
with the orthogonality relation for characters, one finds 


| 7 
ps Fi cos Tap (98509) + 2. Y(a) log L(s, 0): 
XFXO 


Now the divergence of ¢(s) for s > 1+ and the regularity and non-vanishing of the 
values L(s, y) with y # Zo at s = 1 yield the desired statement. 

Dirichlet’s approach is rather explicit. In the case of prime moduli m = p his 
characters are defined by primitive roots and therefore his reasoning for showing 
LU, x) # 0 is first limited to such restricted arithmetic progressions. The first 
appearance of characters can be found in Gauss’ work on cyclotomy [30], the 
modern theory of characters starts with Richard Dedekind and his supplements to 
Dirichlet’s lectures on number theory [20], and the general theory of characters (also 
for finite nonabelian groups) and group representations is due to Georg Frobenius 
[28]. For the general and more advanced case of composite moduli Dirichlet built 
his reasoning on binary quadratic forms aX* + bXY + cY*. For the number h(D) of 


'0See [6] for their intensive exchange of letters. 
"Once again the Humboldt University, formerly Friedrich Wilhems University. 


The additional surname Bartholdy had been added to the Jewish surname with respect to the 
Christian education of the children, a rather common assimilation in these anti-Semitic times. 


316 N. Oswald and J. Steuding 


Fig. 3 From /eft to right: Carl Johan Malmstén (1814-1886), Ferdinand Gotthold Max Eisenstein 
(1823-1852), and Oscar Xaver Schlémilch (1823-1901) 


equivalence classes of these forms with fixed discriminant D = b? — 4ac he proved 
the celebrated analytic class number formula [19],!* for example 


n(D) = VDL. xo) 


for the case D < 0 of positive definite forms, where w denotes the number of units 
in the ring of integers of Q(./D) and yp is a quadratic character defined by the 
Kronecker symbol yp(n) = (2). Since h(D) > 1 this proves L(1,yp) 4 0, 
and another subtle reasoning shows that also the non-quadratic characters do not 
cause any trouble. Dirichlet’s reasoning is precise but more complicated than 
necessary since he used the Jacobi symbol in place of Kronecker’s symbol. Anyway, 
“Dirichlet’s papers, as outstanding for mathematical rigor as Euler had been careless 
about such matters, ...” wrote Weil [112, p. 7]. The first completely analytic proof 
of the non-vanishing of L(1, x) is due to Franz Mertens [73] and uses the Dedekind 
zeta-function associated with quadratic number fields. 

Often Schlomilch is given credit for the first significant contribution to the 
analytic theory of L-functions beyond Dirichlet’s paper. Oscar Xaver Schlomilch 
was a pupil of Dirichlet; in 1845 he was appointed a professorship at the University 
of Jena. In 1856, he founded the Zeitschrift fiir Mathematik und Physik, also known 
as Schlémilchs Zeitschrift, and used this journal to publish several of his results.'* 


'31t has been observed by Minkowski [74, p. 456] that already in Gau8’ estate one can find a note 
from 1801 about the class number formula about 40 years previous to Dirichlet, however, a proof 
is missing; moreover, the Disquisitions [30, p. 369 and 466] contain formulae for sums of class 
numbers without a proof which was first delivered by Siegel [107]. 

'4This periodical was only the third one published in Germany after the renowned Crelle journal 
established in 1826 and run by the Berlin mathematical school, and the less known Archiv fiir 
Mathematik und Physik founded in 1841 by Johann August Grunert in Greifswald. According to 
Koch [56], the reputation of Schlémilch’s journal was not very good. 
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In 1849, Schlémilch announced in [100] the proof of the functional equation 


L(l—s) = (=) I(s) sin(2)L(s), 


where L(s) = 1—3°° +5-*—77-* +... is the Dirichlet L-function to the non- 
principal character y mod 4; the proof, however, was published only in 1858 in his 
journal as [101] although the title of his announcement [100] is literally “Exercises 
for pupils concerning a theorem due to Prof. Dr. Schlémilch.” In 1878, he was the 
first to give a proof of the functional equation for ¢(s) differing essentially from 
Riemann’s methods [102]. Notice that Schlémilch was active in educational politics 
serving from 1874 until 1885 as consultant for the Saxonian Ministery of Education. 
For more details to his life we refer to Cantor’s obituary [10]. 

However, it was Carl Johan Malmstén being the first to publish a proof of a 
functional equation for a Dirichlet L-function. Interestingly, Malmstén’s biography 
shares quite many similarities with Schlémilch’s. Since 1842 Malmstén was 
professor at the University of Uppsala, and he was among the founders of the 
renowned journal Acta Mathematica. He was active as rector of his university 
as well as privy council and member of the Swedish Reichstag (parliament) for 
the period 1867-1870. He is famous for his contributions to the Swedish old- 
age pension insurance and he was member of the Royal Prussian Academy of 
Sciences as well as the Gottingen Academy of Sciences; actually, his daughter Maria 
Heliodora was married since 1876 to the Gottingen mathematician Ernst Christian 
Julius Schering. In 1849, Malmstén [70] published a proof of the functional equation 
for L(s) independent of Schlémilch; moreover he announced that a similar reasoning 
would prove a corresponding result for 


Gd —2')e(s) = 1-2°+3%-4%4+.... 


Actually, Malmstén’s paper bears the date “Upsala lst May 1846.” It might also be 
interesting to note that he used already the variable s as Riemann later, which might 
indicate that Riemann was aware of Malmstén’s contribution. 

But the story before Riemann’s memoir appeared is not yet complete. It was 
Ferdinand Gotthold Max Eisenstein who gave another proof of a functional equation 
for a Dirichlet L-function and this might have been inspiration for Riemann, too. 
Eisenstein was another pupil of Dirichlet and member of the Berlin and Gottingen 
Academies of Sciences, the latter one on promotion by Gauss. It was Andre Weil 
who found in 1989 in Eisenstein’s copy of Gauss’ Disquisitiones Arithmeticae on 
the last and blank page a hand-written proof of the functional equation for L(s) 
bearing the date “Scripsi 7 April 1849” (cf. [112, 113]), the time when Riemann left 
Berlin for Géttingen. Eisenstein’s proof is based on Poisson’s summation formula 
as the second one given by Riemann. It is well known that Riemann was attending 
Eisenstein’s lectures on elliptic functions in Berlin in the summer 1847 but it seems 
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Fig. 4 Thomas Clausen (1801-1885), Hermann Kinkelin (1832-1912), and Wilhelm Scheibner 
(1826-1908). Nowadays only the astronomer and mathematician Clausen is still well known. 
Beards again: two out of three 


that their relation was not too good.!° Why did Riemann give two different proofs 
for the functional equation? Of course, we may only speculate, but a possible answer 
could be to convince himself about the truth of the result; a further reply could be 
to stress the importance, and yet another possibility could be awareness of such a 
result (e.g., Eisenstein’s proof). 

In 1858 Thomas Clausen [11], and in 1862 Hermann Kinkelin [55] gave 
alternative proofs for the functional equation for L(s) and generalizations thereof, 
respectively. Clausen’s work [11] is related to his studies of Bernoulli numbers. 
Kinkelin was a Swiss mathematician working essentially in analysis until 1865; 
later he focused on didactics and mathematics for insurances. In 1865, he was 
appointed a professorship at the University of Basel.'° Also Kinkelin was politically 
active (for a conservative party), and in 1886 he was elected Senior Civil Servant 
(see Schaertlin [97] for details). His studies [55] from 1862 include a proof of the 
functional equation for L(s, x) for characters y mod q with q being a prime power 
or a squarefree odd integer. Kinkelin is not often cited; we can find a note of his 
work in Landau’s Handbuch [59].'’ Though no comment about the arithmetical 
implications of Riemann’s paper is given. It might be that some people of that time 


'SThe quotation “Eisenstein stopped at formal computation” (resp. in German: “Eisenstein sei bei 
der formellen Rechnung stehengeblieben”’) is credited to Riemann; cf. Laugwitz [63, p. 21]. 
!6Where around 1896 also Julius Hurwitz, the less famous brother of Adolf Hurwitz, was lecturing 
for a couple of years; see Oswald and Steuding [78] for details of Julius’ interesting life and career. 
Tt is interesting to read Kinkelin’s comment on what is nowadays called the Riemann hypothesis: 
“The last two equations seem to be satisfied simultaneously only if its root s is of the form 
1/2+t /—1; indeed a rigorous proof of this remark has not been given so far (cf. Riemann)”; the 
German original is: “Die beiden letzten Gleichungen scheinen nur dann gleichzeitig nebeneinander 
bestehen zu kénnen, wenn ihre Wurzel s von der Form 1/2 + tJ/—1 ist; indessen ist der strenge 
Beweis dieser Bemerkung noch nicht gelungen (cf. H. Riemann a.a.O.).” [55, p. 18]. 
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considered Riemann’s results already as the solution of Gauss’ conjecture about the 
number of primes below a given magnitude. Kinkelin used the notation K(s) for 
€(—s) and was very much interested in the relation between Bernoulli numbers and 
the special values of the zeta-function at the integers. There are further contributions 
to be mentioned here. Wilhelm Scheibner [99] was one of the first to analyze 
Riemann’s work; besides he succeeded to establish the case g = 12. In 1857 Rudolf 
Lipschitz [68] gave yet another proof of the functional equation for L(s), however, 
we shall consider his results later in another context. 

Riemann [93] mentioned previous works by Euler and Pafnuty Chebychev (with 
respect to his elementary approach), but he did not mention Legendre, Eisenstein, 
Malmstén, Schlomilch, or anybody else. We cannot be sure whether he knew about 
their results or not. Definitely, Riemann’s insight was much deeper and his methods 
were far beyond what his contemporaries had at hand. 


1.2 The Hurwitz Identity 


Adolf Hurwitz’s article [46] from 1881 starts as follows: 


(...) achieved general formulae of the same simplicity as those found by Schlémilch and 
Riemann, which appear as particular cases. These general formulae might be of special 
interest with respect to their relation to functions playing a substantial role in Dirichlet’s 
deep investigations concerning the number of classes of binary quadratic forms and prime 
numbers in an arithmetic progression.'® 


Notice that [46] was published in Schlémilchs Zeitschrift. Therefore, it is natural 
to mention Schlémilch’s contribution besides Riemann and Dirichlet, however, in 
the whole paper neither references to Malmstén, Eisenstein, nor Kinkelin can be 
found. 

The 22 years old young doctor Adolf Hurwitz signs his paper with the date 
“Hildesheim, den 10. Oktober 1881” although his scientific address at that time was 
Berlin. The letter exchange from this period with his friend Luigi Bianchi is very 
interesting for several reasons. They give the picture of two extraordinarily gifted 
mathematicians who kept their friendship despite the geographical distance between 
each other. 


Infinitely I have enjoyed to have been welcomed by your letter; your last letter with your 
photography (the people are stupid, they go and let them photographed) I have received 
too.!? 


18«(.) gelangte ich zu allgemeinen Formeln, die von gleicher Einfachheit sind, wie die von 
Schlémilch und Riemann gegebenen, und dieselben iiberdies als spezielle Falle enthalten. Diese 
allgemeinen Formeln diirften um so mehr ein Interesse beanspruchen, als sie sich auf eben die 
Functionen beziehen, die bei den tiefen Untersuchungen Dirichlet’s iiber die Anzahl der Klassen 
binadrer quadratischer Formen und tiber die in einer arithmetischen Reihe enthaltenen Primzahlen 
eine wichtige Rolle spielen.”, [46, p. 73/74]. 

'<Unendlich habe ich mich gefreut mit einem Briefe von Dir empfangen zu werden; auch 
Deinen letzten Brief mit Deiner Photographie (Die Leute sind dumm, sie gehen und lassen sich 
photografiren) habe ich erhalten’, [4, p. 77]. 
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Fig. 5 On the /eft Luigi Bianchi (1856-1928) who made important contributions to differential 
geometry (this is probably not the photograph he sent to Hurwitz). On the right a part of his letter 
to Hurwitz with date 28 August 1881 [3] 


wrote Hurwitz to Bianchi in a letter with date 23 December 1881, [4, p. 77]; a later 
letter of Hurwitz to Bianchi from 6 April 1883 must have contained a photograph 
of Hurwitz, too; [4, p. 91]. They continued writing letters to each other their whole 
lives, however, it seems they never met again in person. Of particular interest are 
mathematical topics in their correspondence. In a letter to Hurwitz with date 28 
August 1881 Bianchi wrote (see Fig. 5): 


I am grateful for your function-theoretical note. If I am not wrong the path to generalize 
these results is in considering Dirichlet series 


where n runs through all prime numbers with respect to D. For D = 2 one gets 
Schlémil(ch)’s series and for D = 3 one gets yours.”° 


Unfortunately, the previous letter of Adolf Hurwitz got lost; however, in [4], 
Opere, Vol. XI: Corrispondenza, one can read Hurwitz’s reply to Bianchi from 24 
September 1881: 


Your serendipity has found the correct generalization; however, I have had the same idea a 
few days after sending you my previous letter on my own, since the easy treatable case of 
a prime number had necessarily guided me. Today I shall inform you about the complete 
results. .. 7! 


0In the subsequent paper [79] the authors investigated the Hurwitz estate further and found 
interesting details concerning Hurwitz’s approach to zeta-functions associated with quadratic 
forms. 

21Dein Spiirsinn hat die richtige Verallgemeinerung getroffen; ich hatte allerdings auch ein paar 
Tage nach der Absendung meines vorigen Briefes dieselbe Idee, da mich der leicht zu behandelnde 
Fall einer Primzahl mit Nothwendigkeit darauf fiihrte. Ich theile Dir nun heutige die fertigen 
Resultate mit...”, [4, p. 74]. 
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Luigi Bianchi was a student of Enrico Betti and Ulisse Dini; he received his 
doctorate already in 1877. Afterwards he continued his studies at the University of 
Munich where he became a friend of Adolf Hurwitz. Their correspondence gives 
indeed a lively picture of a deep and lifelong friendship and it seems there was no 
competition between the two young stars. 

We shall briefly outline Hurwitz’s proof (using more or less his notation). First 
of all, given integers | < a < m, he defines 


f(s,@ :=a* + (atm) *4+ (at+2m%+...= n°; 


n=a mod m 


in the now common notation (1.1) this Dirichlet series equals m~*°C(s, a/m), and we 
recover the Riemann zeta-function as €(s) = f(s, 1) in the case m = 1. Building on 
Riemann’s first proof of the functional equation, Hurwitz starts with Euler’s integral 
representation of the gamma-function, 


F(s) = / ~ | exp(—s)dt, 


valid for Res > 0. Substituting t = nx and summing up over n = a mod m, he gets 


I'(s) > n= » [PO -exo-msya 


n=a mod m n=a mod m 


By uniform convergence, interchanging summation and integration is allowed; the 
resulting sum of exponentials is a geometric series with value 


exp(—ax) 


2 exp(—nx) = > exp(—(km + a)x) = ; 


n=a mod m k>0 


It might be interesting to notice that already in Dirichlet’s approach [18] one can 
find a comparable integral representation for the product of a Dirichlet series with 
the gamma-function. Actually, Hjalmar Mellin [71, 72] developed a theory for 
such integral representations coining the notion of a Mellin transform. It might 
be interesting to notice that in 1881/82 Mellin studied at the University of Berlin 
(cf. Lindelof [67]) so he might have met Hurwitz in person when he was starting his 
investigations of the Hurwitz zeta-function. Previous to Mellin it was Eugene Cahen 
[9] who studied for his doctorate such transformations, however, his dissertation 
had been criticized for its many errors by Adolf Hurwitz in his review [48] for the 
Jahresberichte; it was later corrected by Oskar Perron [80] (who also removed some 
mistake of Kronecker [57]). 
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Betrachten wir jetzt in der Ebene der komplexen Variabeln 2 ein 
Rechteck, dessen Seiten der Axe der rein imaginiren Zahlen, resp. der 
Axe der reellen Zablen parallel Jaufen und dessen Mittelpunkt der 
Punkt « = 0 ist (Fig. 2), so wird im Innern desselben die Funktion 


eee: 
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Fig. 6 A picture from Hurwitz’s article [46] showing a drawing of the contour used in the proof 
of the Hurwitz identity 


We continue with Hurwitz’s proof. It follows from (1.2) that 


 exp((m—a)x) ._) 


rofe.a) = | dr. 


exp(mx) — 1 
Now define the integral” 


exp((m — a)x) 
exp(mx) — 1 - 


’ 


166) = f exp(— 1) 1og(—») 


where the contour C is the path starting from +o along the real axis in the upper 
half-plane to a real point A > 0, then counterclockwise continuing on the circle 
o of radius A around the origin and then back to +00 along the real axis in the 
lower half-plane; we may assume that A is so small that the only singularity of the 
integrand inside C is at the origin, and that there is no singularity on C. For positive 


?2Such integrals are now called Hankel integrals in honor of Riemann’s pupil Hermann Hankel 
who used them in his studies [39] of the Gamma-function (although already Riemann [93] used 
them earlier). 
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x the imaginary part of the logarithm log(—x) is equal to —iz on the positive part of 
the real axis and equal to +-iz on the negative part. Consequently, 


, exp((m — a)x) 


dx 
exp(mx) — 1 7 


J(s) = [ ex-ins) — exp(ims))x*— 


exp((m — a)x) 


+ [ exp((s— 1) log(—x) 


The integral over the small circle o tends for Re s > 1 to zero as A —> 0, giving 
jim J(s) = —2isin(as)I'(s)f(s, a). 


Using the formula 


I(s)rd-s) = 


sinzs’ 
Hurwitz deduces 


f(s,a) = ra — s) lim J(s). 


Since the integral lim4—+o J/(s) converges uniformly on compact sets it defines an 
entire function. Consequently, the only possible singularities of f(s, a) are the poles 
of (1 — s) at the positive integers; however, since f(s, a) is obviously regular for 
s = 2,3,..., it only remains to consider the local behavior at s = 1. In view of 
I'(z+ 1) = zf'(z) (which was already known to Euler) one has 


PU Ss) Ss, 
s—1 
and in combination with 


me i ee de 
tim say = [ Go rer ace 


it follows that f(s, a) admits an analytic continuation to C except a simple pole at 
s = | with residue 1/m. Ina similar way Hurwitz deduces that the values f(s, a) for 
s = —2,—3,... are rational functions in a and m, e.g., 
a(m — a)(m — 2a) 
f2,a = ge 
m 


In modern terms this formula is a special case of the formula 


1 
¢(—n, *) = a Bet) (= m"f(—n, a) ) 
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with the Bernoulli polynomials B,(X) defined by the expansion 


t texp(tX) | , => B, we, 


exp()— 1 & 


giving B3(X) = X?— 3X? + 5X in particular. This as well as much of Hurwitz’s 
investigation of the functional equation in modern form can be found in Lang’s 
books [61, 62]. 

Hurwitz’s next aim is the proof of a functional equation for his function f(s, a). 
For this purpose the integral J is evaluated in a second different way. In place of 
C where the integral was taken over a circle of radius A centered at the origin the 
new path is a counterclockwise oriented rectangle with vertices (2N + 1)=(41 +i) 
which includes the simple poles of the integrand at x = ~~ with integers n such 
|n| < N and excluding all others. Thus the new integral differs by the old one by the 
sum over all residues resulting from these poles except the one at the origin. It is no 
difficulty to compute the residues for +n as 


2min 


1 (2nn\*' 
— | — exp(+F Big 1) = 2nit). 
m 


m 


Now let Res < 0. Since then the integrals over the edges of the rectangular paths 
vanish as N — oo, Hurwitz arrives at the equation 


f(s,a) = = (=) P(L—s) 5° So (mk + by! cos($(s — 1) + 20). 


b=1 k=0 
Finally, after replacing s by 1 — s, this leads to the celebrated Hurwitz formula 


m 


l-s 
f(l—s,a) = - (=) I'(s) DOr — 3)f(s,b). (1.3) 


By analytic continuation the latter formula holds for all values of s. 
Using this powerful identity Hurwitz investigated next his main target, that is the 
Dirichlet series 


F(s,D) = >. (>) n~, 


n>1 


where n +> (2) is the Jacobi symbol associated with a squarefree positive integer 
D = 1 mod 4 and the summation is over all odd 1 coprime with D; for the cases 
D = 2 or 3 mod 4 a harmless factor has to be multiplied, however, for the sake 
of brevity we shall consider only D = 1 mod 4 here. Using a decomposition of 
F(s, D) into his functions f(s, a) with m = |D| and exploiting quadratic reciprocity, 
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Hurwitz proves that with the only exception D = | the function F(s, D) is an entire 
function satisfying the functional equation 


F(-—s,D)= (2) FET FD), 


2 

In principle, his reasoning is a simple application of the representation 
L(s, x mod m) = > x(af (s, a) 

amod m 


in combination with his identity (1.3) and explicit formulas for Gauss sums to 
primitive characters y mod D, namely 


Y> x(a) exp (=) = f0-xCD) p}, 
D 


amod D 


which appear first by Dirichlet [18] (continuing Gauss’ work [30]). Hurwitz’s rea- 
soning is actually slightly more complicated as it is nowadays presented in textbooks 
since the notion of primitive characters was at his time not well formulated. As 
a matter of fact primitive characters have been used by Edmund Landau in his 
1909 monography [59]; for some reason (see ‘Quellenangaben’ in [59, p. 898]) 
his reference for that is an article by Lipschitz [69] from 1889 (more precisely he 
refers to pages 142-144), however, as already pointed out by Narkiewicz [77], the 
characters considered by Lipschitz are primitive but Lipschitz did not invent the 
distinction in between primitive and imprimitive characters. Whereas Hadamard did 
not consider the case of arithmetic progressions in his proof of the prime number 
theorem de la Vallée Poussin worked with characters, however, there is no notion of 
a primitive character in his works (since his approach does not rely on the functional 
equation). A Dirichlet Z-function to an imprimitive character does not satisfy a 
functional equation of the Riemann type. 

We conclude this paragraph with another quotation from the letter Hurwitz sent 
to Bianchi on 24 September 1881: 


My social life is very active thanks to music. I accompany my singing female friends and 
play with them (you don’t need to think anything bad). Recently, we played eight-handed 
with two pianos: three women and myself. I don’t know the article of Lipschitz and cannot 
receive it from here. From Berlin I can tell you about its contents if it is still of any value 
for you.” 


Their correspondence [4] contains two further notes on this topic, the first one 
is a variation of Dirichlet’s class number formula Hurwitz found (and reported to 


3“Mein gesellschaftliches Leben ist durch die Musik ein sehr reges. Ich begleite meine Freundin- 
nen zum Gesang und spiele mit Ihnen (Du brauchst nichts Schlechtes zu denken). Neulich haben 
wir sogar 8-handig auf 2 Claviren musicirt: 3 Damen und ich. Den Aufsatz von Lipschitz kenne 
ich nicht und leider kann ich mir denselben hier auch nicht verschaffen. Von Berlin aus kann ich 
Dir tiber denselben berichten, wenn es dann fiir Dich noch Werth hat,” [4, p. 76]. 
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Bianchi in a letter from 23 December 1881) and the second is about the delay in the 
publishing process (in another letter with date 20 March 1882). Not investigating 
what Lipschitz had done is juvenile omission. In the following we shall consider the 
contributions made by Lipschitz and some other contemporaries. 


1.3. Aftermath and Further Generalizations by Lipschitz, 
Lerch, and Epstein 


Rudolf Lipschitz was a student of Dirichlet. In 1864 he became professor in Bonn. 
He had been considered as one of the leading analysts of his time. In his first 
note on Dirichlet series [68] (and one of the first of all his papers) from 1857 
Lipschitz studied the real and imaginary part of the ¢(s) defining Dirichlet series 
and generalizations thereof, namely 


+00 
Y> exp(2xV—1v)(n + u)V-1 +k): 


n=—oo 


he obtained integral representations and identities for these infinite series. Here o is 
assumed to be a real variable; in that sense Lipschitz gave birth to the funny mixture 
of writing s = o + it with Greek and Latin characters for the complex variable 
(which is often attributed to Landau; the imaginary part t stems from Riemann’s 
paper), only the use of the letter “i” for the imaginary unit /—1I cannot be found 
in his papers. With his second article [69] from 1889 Lipschitz succeeded to give a 
more or less complete account of Dirichlet L-functions within an analysis of a more 
general Dirichlet series. Following Riemann’s first proof, Lipschitz [69] obtained 
the identity 


_ exp(—2ziv(n + u)) ge exp(2zinu) 
r(S)y eee arg) eee 
(1 + u)’n) n=—oo (n+ vm)? 


n=—oo 


valid for Res € (0,1). Using Gauss sums, he deduced the functional equation 
for quite a few Dirichlet L-functions; his reasoning is rather general. Later, Vallée 
Poussin [17] found another simplified proof for the functional equation of Dirichlet 
L-functions; the method of proof relies on Riemann’s second proof of the functional 
equation for ¢(s). For this purpose Vallée Poussin proved a character version of the 
Jacobi theta-function transformation formula. His proof became standard, appearing 
first in Landau’s Handbuch [59] (see also Davenport [13]). 

Besides Hurwitz, Lipschitz gave credit to Mathias Lerch”* [65] who studied two 
years earlier the infinite series which is nowadays well known as the Lerch zeta- 


*4Or Matyas Lerch in Czech; Lerch started school only at the age of 9 years because of a severe 
injury of his left leg, a handicap he suffered for the rest of his life. For details of his unfortunate 
career and difficult character we refer to Porubsky [92]. 
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function, namely (in modern standard notation) 


L(A, a,s) = > exp(27iAm)(m + a)~* 


m>0 


with real A and @ (so that L(1,a,s) = €(s,a) with a € (0, 1]). For this function 
Lerch obtained analytic continuation to the whole complex plane except a simple 
pole at s = 1 in the case of integral A and he derived a functional identity of the 
form 


LO,a,1—s) = (21) *I'(s)(exp( — 2miadA)L(—a, A, 5) 


+ exp(— + 2mia(l — A))L(a,1—A,5)), 
valid for A € (0,1),@ € (0,1], and Res € (0,1). This formula includes the 
Hurwitz formula as limiting case A — 0+. It is interesting that Lerch even allowed 
complex values for A and s. It seems that complex numbers and methods were 
widely accepted in Lerch’s generation but less in the former. His reasoning is also 
based on Riemann’s first approach to the functional equation. As particular case one 
deduces a result due to Lipschitz [68] and Kronecker [57], namely the identity 


3 exp(27imd) — exp(2m iad) 

—§_|_ = 2x1 -————_—.. 
mong «6 1 — exp(ziad) 
Lerch mentioned previous works by Malmstén [70] and Lipschitz [68]; moreover, in 
a later paper [66] he even called the latter series the Lipschitz function.> Malmstén 
investigated the even more general series 
3 exp(2zrinu) 

(n+ u)? + v?)3” 


n=—oo 


which has some similarity with the so-called Epstein zeta-function associated with a 
quadratic form. As a matter of fact, already in a letter to Bianchi from 1881 Hurwitz 
had mentioned these series”: 


5In 1889, Alfred Jonquiere [54] obtained the analogue of Hurwitz’s formula for the polylogarithm, 
however, Lerch’s result is more general. 


6m aus diesen Formeln Vortheil fiir die Classenzahlen zu gewinnen, miisste man den entspr. 


Zusammenhang fiir die 
‘3 1 Ss 
ax? + 2bxy + cy? 


erforschen.” 
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Fig. 7 From left to right: Rudolf Otto Sigismund Lipschitz (1832-1903), Mathias Lerch (1860- 
1922), and Paul Epstein (1871-1939) who committed suicide during the Nazi time in the face of 
a summons by the Gestapo. For details about Epstein’s life we refer to Siegel [108] and Burde 
et al. [8] 


In order to gain information about class numbers from these formulae one would have to 
investigate the corresponding relationship for 


1 S 
X (a + 2bxy + #) 


This series appeared already in Dirichlet’s paper [19]. A thorough analysis of 
such Dirichlet series associated with quadratic forms was realized by Paul Epstein 
in his 1903 paper [22]. 

In Hurwitz’s Diary no. 6 with date 17 August 1889 we can find the following 
lines which seem to be a proposal for a letter he had probably written to Lipschitz, 
however, this letter seems to be lost”’: 


Dear privy councillor! Mr Minkowski had informed me some time ago that my treatise on 
Dirichlet series is intimately related with your article published in the 54th issue of Crelle’s 
journal. Unfortunately, I did not know this article while I was writing my treatise, hence I 
omitted to mention it. Recently, you were so kind to send me your article “Untersuchung 
der Eigenschaften einer Gattung von unendlichen Reihen’ touching this topic. Therefore I 


27 At least, Hurwitz is not mentioned in Scharlau’s list [98] of letters Lipschitz had received from 
his contemporaries. There is also no correspondence between Lipschitz and Hurwitz in the latter 
one’s estate, neither in Zurich nor in the archive of the Staats- und Universitatsbiliothek Gottingen 
where most of his letters are stored. 
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Fig. 8 The proposal for a letter to Lipschitz with date 17 August 1889, found in Hurwitz’s 
Mathematical Diary no. 6, p. 115/116. Figures 15 and 16 show these pages in large 


would like to thank you and mention a few further results which I obtained in continuation 
of my work about D.’s series long ago. While being occupied with further questions I was 
deducted from a rigorous examination of these investigations, and therefore I am not quite 
sure whether all conditions I suppose on the introduced objects are necessary. The theorems 
I have found consider the function 


Nie 


e2ri(nivi Hn202+...F Mp dp) 


A 
m s) =r) 2S ae a 


N1N2,...Mp 


Here u),...Upy,V1,...Upy denote 2p real quantities; the summation is with respect to all 

ositive and negative integer values n,, nz, ...n,, where in case of integers wu, U,...u, the 
15/2 ‘Dp 1,42 iD 

combination nj = —uj1,n2 = —uz,...Ny = —Up is excluded. Moreover, p(x1,X2,... Xp) 


stands for a positive quadratic form of (without loss of generality) determinant equal to 1. 
So far the sum converges for all values of s having real part greater than - The function 
f(s) can be continued over the whole complex plane and it comes out that this function is 
univalent and is infinite of the first kind only for s = 0 and s = p/2. If we consider the 
function 


Ne 


eri +n2U2+...4+-npup) 


A = 
> [P(m + v1, M2 + 02,...Npy + Up)IS 


Pp 
2) F(s) =(s) (=) 


where ¢ is the adjoint form to g, then the function F(s) has the same analytic character as 


f(s) and 
FG 2 s) = en 2rituniturn+..tunin) A= (5) 


f(s) = eri La A> Ae —s) 
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p(p—2) 
Fil) = F@) (==) 


F\(s) = f(A?! 


Nie 


et erin vi +... Mprp) 
> AP—Dslo(ny + m,.. -Mp + Up) I° 


Wishing you pleasant h. I finish with my best regards. Yours sincerely, A. Hurwitz”® 


°8Sehr verehrter Herr Geheimrat! Herr Minkowski hat mich schon vor einiger Zeit darauf 
aufmerksam gemacht, daB meine Abhandlung iiber die Dirichlet’schen Reihen in enger Beziehung 
zu Ihrem im 54 ten Bande des Crelle’schen Journals publizierten Aufsatz steht. Leider war mir 
dieser Aufsatz zur Zeit als ich jene Abhandlung schrieb nicht bekannt, so daf} ich es unterliei auf 
denselben Bezug zu nehmen. Neuerdings waren Sie so freundlich mir Ihre denselben Gegenstand 
betreffende Abhandlung ‘Untersuchung der Eigenschaften einer Gattung von unendlichen Reihen’ 
zuzusenden. Ich méchte Ihnen hierfiir meinen herzlichen Dank aussprechen und mir zugleich 
erlauben, Ihnen einige weitere Resultate mitzutheilen, welche ich im Anschlu8 an meine Arbeit 
tiber die D. Reihen vor langerer Zeit erhalten habe. Durch die Beschaftigung mit anderen Fragen 
bin ich immer wieder von der sorgfaltigen Durcharbeitung dieser Untersuchungen abgezogen 
worden, und ich wei deshalb nicht, ob die Bedingungen welchen ich die eingef. Groen 
unterwerfe sdémtlich nothwendig sind. Die Satze welche ich gefunden habe beziehen sich auf die 
Function 


s P 
3 e2ti(ni vi Hn202F...FNpUp) 


A 
GQ HO= 1) (=) > [y(n + uy, 2 + u2,...Np + Upy)]* 


NY N2,..Np 


Hier bedeuten u1,...Up, V1,... Up 2p reelle GréBen; die Summation bezieht sich auf alle positiven 
und negativen ganzzahligen Werthe von nj, 12, .../»), wobei jedoch falls uw, uz,...u, ganzzahlige 
Werthe besitzen die Combination ny = —u,,n2 = —U2,...Ny) = —Up auszuschliefBen ist. 
Ferner bedeutet g(x), x2, .. .x») eine positive quadratische Form, deren Determinante (unbeschadet 
der Allgemeinheit) gleich 1 angenommen wird. Die Summe convergirt bisher fiir alle Werthe 
von s deren reeller Best. gréBer als ist. Die Function f(s) la8t sich iiber die ganze complexe 
Zahlenebene fortsetzen und es zeigt sich dabei, dafi diese Function in der ganzen complexen 
Zahlenebene eindeutig ist und nur fiir s = 0 und s = p/2 von der ersten Art unendlich ist. 


Bildet man nun die Function 
—2ri(nyuy+ngU2+...+-Npup) 


Ao 
@) =) (= ) 3 [o(m = Vj,N2 + V2,...Ny + vp)]S 


wo @¢ die zu g adjungirte Form bezeichnet, so hat die Function F(s) denselben analytischen 
Charakter wie f(s) und es ist 


Z o—1 
FG —s)= eo 2itvituavet...bunvn) AZ (5) 


f(s) = =, ew 2ni uw pa TFB = 5) 


Ape?) 5 et 2ri(ni v1 +... +N rp) 
A =r0(-) Dae 
a AY [p(m + Uy,..-Np + Up)] 


Fi(s) = f(Ae-- 


Indem ich Ihnen angenehme F. wiinsche, verbleibe ich mit freundlichen GriiBen. Ihr ganz ergebener 
A. Hurwitz” Notice that we have not copied scratched formulae or fragments of sentences. The 
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The analytic properties of this zeta-function were proven in Epstein’s paper 
published in 1903 and submitted in January 1912, so 13 years after Hurwitz’s letter 
to Lipschitz. In Epstein’s paper [22] the notation 


as mi >, mhj 
Aa Co e Dy mihy 
Oy Sx 
hy... hp pee TE I)? 
is used, where m = (mj,...,mp) is a tuple of integers, g = (g1,...,g,) and 
h = (hy,..., hp) are tuples of real numbers, and g((m + g)) is the quadratic form 
defined by 
Pp 
p((x)) = > ChAXKXE 
kl=1 
with cye = cex and x = (x1,...,Xp). Following Riemann’s contour integration 


method, Epstein derived the following functional equation 


2M Lk Silk — 3 PP (3)z|¢ see a (5) 


2) | hy... Mp 


= ater (P*)z| jean 


2 81... 8p 


In the easiest case, when all hy and gx, are zero and takes the representation 
y((m)) = m'Qm with some symmetric p x p-matrix Q into account, one deduces 
via substituting s +> 2s the functional equation in the form 


m*T(s)6(s;Q) = (det 'x Er (E—s)¢(F-s.0) 


for the Epstein zeta-function 


C(s,Q):= > (m'Qm)* 


OAmEZP 


as it usually appears in the recent literature. We notice that this is essentially in 
coincidence with Hurwitz’s remark. Epstein mentioned Hurwitz’s paper [46] on the 
Hurwitz zeta-function as well as Lipschitz [69] and Lerch [65] (but, of course, not 
Hurwitz’s diary entry nor his lost letter to Lispchitz). Furthermore, Epstein wrote 
Consequently, for the so far most general zeta-function, investigated by Mr Lipschitz, there 


exists an integral representation in terms of an infinite theta series of arbitrary characteristic. 
Following this thought we define a general zeta-function of pth order by a p-folded infinite 


abbreviation ‘F” might stand for ‘Ferien’ meaning ‘holidays’, so we have used the letter ‘h.’ in our 
translation. 


332 N. Oswald and J. Steuding 


series which generalizes Riemann’s zeta-function in just the same way as the general theta 
series of pth order generalizes the elliptic theta series. This extension of the zeta-function 
is natural since the above mentioned theorem of Riemann about the function ¢(s) can be 
transferred to the general zeta-function in a surprisingly easy way.” 


We do not want to go into the details of the appearing notions; and we also do 
not explain here the different theta-functions but remark that Epstein’s approach 
is obviously via Riemann’s second proof of the functional equation (and Vallée 
Poussin’s generalization to Dirichlet L-functions).*° 

A last comment about Hurwitz’s letter to Lipschitz. In 1887, after studying with 
Hurwitz in K6nigsberg, his former pupil and friend Hermann Minkowski became 
professor at the University of Bonn where Lipschitz was professor (from 1864 until 
the end of his life in 1903). We may speculate that this contact had been a reason for 
the correspondence between Hurwitz and Lipschitz.*! Bianchi’s hint from his 1881 
letter is not mentioned. 

We conclude with a discussion about the name attributed to Hurwitz’s general- 
ization. Reading Landau’s historical survey [58] from 1906, we find the Hurwitz 
zeta-function with notation ¢(s,w); a quotation of Hurwitz is missing, however, 
Mellin [71] is mentioned. In 1929, Walter Schnee [103] gave a new proof of 
the functional equation for ¢(s,a@) and Dirichlet series with periodic coefficients, 
however, Hurwitz is not mentioned. The same in Helmut Hasse’s paper [41], 
where ¢(s, a) is called “allgemeine ¢-Reihe.” In 1950/51 this was changed by two 
articles in the same volume of the Proceedings of the American Mathematical 
Society, namely Fine [26] and Apostol [2]. The first paper, the one by Fine, gives 
a proof of the functional equation for the Hurwitz zeta-function by means of 
Riemann’s second approach using Poisson summation and properties of Jacobi’s 


?°Dem entsprechend existirt fiir die von Herrn Lipschitz untersuchte Function—die allgeme- 
inste bisher betrachtete ‘Zeta’-function—eine Integraldarstellung mit Hiilfe einer unendlichen 
Thetareihe mit beliebiger Charakteristik. Diesem Gedankengange folgend definiren wir als 
allgemeine Zetafunction p'*’ Ordnung eine p-fach unendliche Reihe, welche in gleicher Weise 
eine Verallgemeinerung der Riemann’schen ¢-function darstellt, wie die allgemeine Thetareihe 
p’" Ordnung gegeniiber der elliptischen Thetareihe. Dass diese Erweiterung der Zetafunctionen 
naturgemiass ist, zeigt sich vor allem darin, dass sich der oben angefiihrte Riemann’sche Satz tiber 
die Function ¢(s) in tiberraschend einfacher Weise auf die allgemeinste Zetafunction tibertragen 
lasst.” [22, p. 616]. 


3°Tn a subsequent paper the authors intend to investigate the Hurwitz estate further in order to find 
details about his approach to zeta-functions associated with quadratic forms. 


31Tn 1897, Minkowski married Auguste Adler from Strasbourg, the place where Epstein obtained 
his doctorate in 1895 (advised by Elwin Christoffel) and remained as docent until 1919 when 
he received the call of the recently founded university in his native town of Frankfurt/Main. 
As a matter of fact, Minkowski was frequently visiting his brother and famous physician Oskar 
at Strasbourg starting from 1889 as we know from Hermann Minkowski’s letters to Hilbert 
[75]; during some of these visits Minkowski got in touch with the mathematicians at Strasbourg 
University and Christoffel in particular (see [75, p. 34 and 36]). A word about Frankfurt University, 
too. This institution was founded in 1914, mainly by donations of the Jewish community of 
Frankfurt, but the Jew Epstein was driven to commit suicide under the Nazi regime. Actually, 
all members of the Frankfurt Mathematical Seminar except Siegel lost their positions. 
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theta-function. The second paper, the one by Apostol, gives another proof relying 
on Lerch’s transformation formula. Both authors explicitly call €(s, «) the Hurwitz 
zeta-function. We do not consider the many further research papers on the Hurwitz 
zeta-function here. The name Epstein zeta-function for the generalization associated 
with quadratic forms, however, has been used since Edward Charles Titchmarsh’s 
article [110] from 1934. 

The Hurwitz zeta-function ¢(s,a) with an irrational parameter a is not treated 
in Hurwitz’s original paper. For example, in the monographs of Lang [61, 62] and 
GarunkStis and Laurincikas [29] the reader may find (essentially the same) proof for 
the Hurwitz zeta-function with a not necessarily rational parameter and in the latter 
one the proof of the functional equation for the more general Lerch zeta-function. 

Obviously, some of the mathematicians mentioned above were not very well 
informed about what others did. The standards of citation were rather different in 
the nineteenth century, and so the distribution of research results and of journals as 
well. Another aspect is that it was not quite clear in the process of generalization 
from the Riemann zeta-function and Dirichlet L-series which one of all the various 
possible generalized zeta-functions would be the best (if one really would like to 
coin the notion “best” in this context). The struggle for the best generalization of the 
Riemann zeta-function reminds the authors on the numerous approaches to multiple 
(multivariate) zeta-functions and zeta-values in the literature (e.g., the Euler-Zagier 
multiple zeta-functions) investigated intensively for about 25 years. 


2 K6nigsberg and Zurich: Entire Functions 


In 1884 Adolf Hurwitz became extraordinary professor at the University of KGnigs- 
berg (now Kaliningrad in Russia) on invitation of Ferdinand Lindemann. Although 
friends and relatives as well as most of his former mathematical friends and 
colleagues were far distant Hurwitz spent a very good time in K6nigsberg. He 
produced many beautiful theorems and he was rather successful in teaching: 
among the few Konigsberg students Hurwitz had with David Hilbert and Hermann 
Minkowski two brilliant pupils. During frequent walks Hurwitz introduced the two 
students to various mathematical disciplines and guided their first steps in research. 
The three of them became lifelong friends. Moreover, Adolf Hurwitz got to know 
his later wife, Ida Samuel, the daughter of the professor for Pathology at K6nigsberg 
University; they married in summer 1892. In the same year Frobenius left Zurich for 
Berlin and Hurwitz was offered the vacant ordinary professorship. The young family 
moved to Switzerland. 
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Fig. 9 Ida Samuel-Hurwitz (1864-1951) and Adolf Hurwitz; the photograph has been taken in 
the KGnigsberg period, probably around the time of their marriage in 1892 


2.1 Prehistory: The Proof of the Prime Number Theorem 


Riemann’s celebrated paper [93] is at least nowadays considered most important 
with respect to its not rigorously proved statements and conjectures. This memoir 
has indeed been a signpost for developing complex analytic methods in order to 
reveal distribution properties of the prime numbers. Taking Riemann’s personality 
into account one may understand his contribution also as an outline of a far 
reaching analytic method in number theory, leaving the details to the posterity. The 
announced product representation was obtained by Jacques Hadamard [35] in 1893, 
the explicit formula linking the zeros of ¢(s) with the prime numbers was proved 
by von Mangoldt [111] in 1895, the so-called Riemann-von Mangoldt formula 
for the number of zeros inside certain rectangles was as well obtained by von 
Mangoldt in the same paper; only the famous Riemann hypothesis claiming that all 
nontrivial zeros lie on the critical line has remained unsolved until today.** Building 


3? Actually, Riemann wrote that it is “very likely” that all these zeros lie on a line; he did neither 
announce this comparable to his other claims nor did he formulate it as a conjecture. Nowadays 
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Fig. 10 On the left Jacques Hadamard (1865-1963); together with Emile Zola and others he was 
supporting Alfred Dreyfus in the Dreyfus affair in the 1890s. In the middle Charles Jean Gustave 
Nicolas Baron de la Vallée Poussin (1866-1962). It is legend that everyone who provides a new 
proof of the prime number theorem is immortal; Hadamard and Vallée Poussin died at age almost 
100 years. On the right-hand side Hans von Mangoldt (1854-1925) who served as rector for the 
Konigliche Technische Hochschule Aachen (now RWTH Aachen) for 1899-1901. His contributions 
became major tools in zeta-function theory 


on Riemann’s ideas, Hadamard [37] and Charles de la Vallée Poussin [16] solved in 
1896 independently a conjecture of Gauss on the number of primes below a given 
magnitude; their reasoning culminated in the celebrated prime number theorem giv- 
ing the asymptotics for the counting function 2 (x) of prime numbers p < x, namely 


] 
lim z(x) Bee 1, resp. I(x) ~ = 
x00 x log x 


as x — oo. More precise formulae provide explicit error terms depending on the 
zero-free region for the zeta-function. For details on the history of the development 
of the prime number theory we refer to Narkiewicz’s monumental [77], Laugwitz’s 
Riemann biography [63], and Schwarz’s survey [104]. 


2.2 Hadamard’s Theory of Entire Functions of Finite Order 


A function f(s) is called entire if it is analytic (holomorphic) throughout the whole 
complex plane; if its absolute value is not growing too quickly as |s| tends to infinity, 
it is said to be of finite order (and “of finite genre” in older literature). Since the 
Riemann zeta-function ¢(s) is analytic except for a simple pole at s = 1, it is natural 
to consider (s — 1)¢(s) or similar functions with respect to the general theory of 


we can only speculate whether Riemann knew about the arithmetical consequences of such a zero 
distribution. 
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entire functions. This line of inquiry had been established among others by Jacques 
Hadamard in his dissertation [34] from 1892 advised by Emile Picard. Hadamard’s 
motivation, however, was not the theory itself but its application to the unsolved 
problems around the Riemann zeta-function. In fact, his theory of entire functions of 
finite order constitutes a major ingredient in his and de la Vallée Poussin’s celebrated 
proof of the prime number theorem and later refinements. 

In the Jahrbuch der Mathematik we can read in a review by Adolf Hurwitz 
concerning Hadamard’s article [35]: 


The article under review is the accomplishment of a work which had been awarded the 
great mathematical prize of the Parisian Academy of Sciences. Taking into account that the 
author had only a limited amount of time, it is attributed that the formulation of the article 
causes trouble to its comprehension. For example, the proof of the main theorem from the 
first part should have been arranged differently in order to be rigorous. This is just about the 
outward appearance of the work; as regards content it may be considered as one of the most 
important function-theoretical pieces of the last years.** 


Indeed, Hadamard’s article under consideration had been awarded the Grand 
Prix des Sciences Mathématique of the French Academy of Sciences in 1892 
for a treatise on the Riemann zeta-function (reporting about the results from his 
doctorate). Hurwitz wrote several hundreds of reviews of articles in German, 
French, Italian, and English for the Jahrbuch tiber die Fortschritte der Mathematik** 
and these reviews provide a mathematical fingerprint of his research activities 
and interests. Hurwitz followed the many works dealing with entire functions 
in particular and his diaries contain numerous entries about this topic. In his 
Mathematical Diary No. 15, we find for 22 February 1897 an entry analyzing the 
Hadamard’s papers [36, 37] on the distribution of zeros of €(s) and its arithmetical 
consequences. It is interesting to notice that Hadamard cited Hurwitz’s paper [46] 
(besides Cahen [9]) and made use of his identity. 


2.3. Polya and Hurwitz’s Estate 


George (Gyérgy) Polya was born in Budapest 1887; he finished his studies 
in 1912 with a doctorate supervised by Lipot Fejér in Budapest, investigating 


33"Die vorliegende Abhandlung ist die weitere Ausfiihrung einer Arbeit, die von der Akademie 
der Wissenschaften zu Paris mit dem grossen mathematischen Preise gekront worden ist. Dem 
Umstande, dass der Verfasser nur eine beschrénkte Zeit zur Verfiigung hatte, ist es wohl 
zuzuschreiben, dass die Abfassung der Abhandlung dem Verstandnisse manche Schwierigkeiten 
bereitet. So muss beispielsweise der Beweis des Hauptsatzes im ersten Teile der Arbeit anders 
angeordnet werden, um bindende Kraft zu erhalten. Diese Ausstellung betrifft indessen nur die 
aussere Form der Arbeit; inhaltlich darf dieselbe wohl als eine der bedeutendsten functionentheo- 
retischen Arbeiten der letzten Jahre bezeichnet werden.” 

34Which was founded as early as 1868 by Carl Orthmann and Felix Miiller and served as pre- 
runner of Mathematical Reviews and Zentralblatt until 1942; now its many reviews are included 
into the Zentralblatt data bank. 
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Fig. 11 In Hurwitz’s Mathematical Diary no. 15, 22 February 1897, p. 38/39, one can find detailed 
notes about Hadamard’s reasoning for proving the non-vanishing of (1 + it) for real t from his 
1896 paper [37] which is equivalent to the prime number theorem as was first proved by Landau 
and used in his proof of the prime ideal theorem [58] in 1903. Interestingly, also Diary no. 11 
contains a few notes on the pages 52-66 about the same article [37] similar to the one in Diary 
no. 15 from above. They appear without date; in fact Diary no. 11 was started in 3 October 1894 
but never finished 


geometric probability. After a postdoc in Gottingen he started at the Eidgenéssische 
Hochschule Zurich in 1914 on promotion by Hurwitz. He became extraordinary 
professor in 1920 and ordinary professor only in 1928 (despite his numerous 
important results).*° In 1940 Pélya moved to the USA where he had professorships 
at several universities, since 1942 in Stanford. Pélya died in Palo Alto in 1985. He 
is considered as one of the outstanding mathematicians of the twentieth century. As 
his mentor and colleague Hurwitz Polya had a broad mathematical knowledge and 
interest. 


He also credited Hurwitz’s extensive mathematical diaries for a number of problems that 
later appeared in the Aufgaben und Lehrsdtze. These diaries may have influenced Polya in 
other ways: he kept a mathematical log himself throughout much of his career, recording 
mathematical conversations and keeping track of his own mathematical ideas. 


wrote Alexanderson and Lange [1]. The mentioned book Aufgaben und Lehrsdtze 
aus der Analysis was a joint work with Gabor Szegé and the innovative style of this 
collection of exercises (ordered with respect to methods to prove these results rather 


35The Lerch pupil Michel Plancherel was chosen to inherit Hurwitz’s chair, not Pélya although he 
was considered as outstanding mathematician by none less than Hilbert and Hadamard; see Frei 
and Stammbach [27, p. 51] for details. 
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Lit 


Fig. 12 On the left: George Pélya (1887-1985) who also kept a mathematical diary. In the middle: 
Johan Ludwig Valdemar Jensen (1859-1925) who did not keep a mathematical diary. On the right 
another photograph of Adolf Hurwitz about the time when he got to know Pélya 


than the results itself) is still a pearl in the mathematical literature. Besides many 
interesting tasks it includes, for example, the proof of the existence of infinitely 
many prime numbers by use of the coprimality of the Fermat numbers 2" + 1 which 
is probably based on lectures of Hurwitz’ which were later published as [50]*° and 
the letter exchange between Goldbach and Euler; for details we refer to Haas [33]. 

Starting in 1921, two years after Adolf Hurwitz’s death in Zurich 18 November 
1919, Polya became literary executor of Hurwitz’s estate. In 1933 Hurwitz’s 
collected papers were finally published, edited by Polya. The mathematical diaries 
of Adolf Hurwitz definitely had been another source of inspiration for Pélya.*” Here 
we shall focus on zeta-function theory. 


Of those topics which are treated in the diaries for several years again and again we 
shall mention here Riemann’s zeta-function as an example; in the published works of 
Hurwitz this object is not mentioned at all. Very early Hurwitz had tried a path towards 
the Riemann hypothesis in vain which some years later also Jensen found: (...) The aim 
was unreachable.** 


These words are due to George Polya introducing and commenting the unpub- 
lished notes of Adolf Hurwitz [90, p. 6]. The first mathematical diary started in 25 
April 1882, too late for Hurwitz’s work on the Hurwitz zeta-function [46]. 


36 And may have been also an inspiration for the book [91] of Pélya & Szegé. 

37Since in Hurwitz’s last diary one can find some notes from 1918 about Arthur Cayley’s studies 
of counting trees, namely alkane C,,H,4 2 with certain restrictions, one could speculate whether 
this might have been an inspiration for Pélyas celebrated enumeration theory. 

38Von denjenigen Gegenstiinden, auf welche die Tagebiicher lange Jahre hindurch immer wieder 
und wieder zuriickkommen, sei hier als Beispiel nur die Riemann’sche ¢-Funktion hervorgehoben; 
in den ver6ffentlichten Arbeiten von Hurwitz wird dieser Gegenstand wohl nicht einmal erwdhnt. 
Schon friih hat Hurwitz den Weg zur Riemann’schen Vermutung versucht, den einige Jahre spiater 
auch Jensen, ebenfalls ergebnislos versucht hat: (...) Das Ziel war unerreichbar.” 


Zeta-Function Theory in the Mathematical Works of Adolf Hurwitz 339 


Hurwitz was the type of mathematician who always achieved something when seriously 
using his power, if not the envisaged objective, then at least something interesting besides. 
Two side issues of his efforts about the ¢-function reached the mathematical community by 
oral communication... .*° 


This quotation is again from Pélya’s introductory words to Hurwitz’s estate [90], 
p.7. Indeed Polya himself had continued some of Hurwitz’s ideas. 

In 1914, Godfrey Harold Hardy [40] proved that infinitely many zeros of ¢(s) 
lie on the critical line. His reasoning was based on the real-valued function defined 
essentially by the left-hand side of the functional equation, i.e., 


EQ := g9(s— Dx TEE ap 4 


and its integral representation 


[o.@) 
a(t) = 5 -(P+ pf xa cos(5tlogx) > exp(—mn?x)dx. 


n>1 


Hardy’s reasoning relies on the integral 


[o-e) rr 
& (t) ——— cosh(at)dt 
/ (Oz qoosmlan 


and taking the limit # — 7— under the assumption that &(t) is of constant sign 
for all sufficiently large t. By the upcoming contradictions it follows that &(¢) must 
have infinitely many odd order zeros, and since all factors of &(t) do not vanish 
except € (4 + it) possibly, this proves the existence of infinitely many zeta zeros on 
the critical line. In Hurwitz’s Mathematical Diary no. 26 we can find an outline of 
Hardy’s proof. Pélya [87] found a variant of Hardy’s proof by the associated Fourier 


integral 


P(u) := Lf & (t) cos(ut)dt 


and its Taylor expansion. The same line of inquiry has been treated by Polya in his 
papers [85, 86, 88]. Probably, the easiest proof of Hardy’s theorem is due to Landau 
[60, p. 78], which relies on the inequality 


2T 2T 
/ Z(t)dt < / |Z(t)|dt, 
T T 


39*Nun war Hurwitz von dem Mathematikertypus, der bei ernstlicher Einsetzung seiner Krifte 
immer etwas erreicht, wenn nicht das urspriinglich ins Auge gefasste Ziel, so doch etwas 
Interessantes auf den Seitenwegen. Zwei Nebenresultate seiner Bemiihungen um die ¢-Funktion 
sind durch miindliche Mitteilungen in die mathematische Oeffentlichkeit gelangt: ...”. 
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Fig. 13. Hurwitz’s notes of Hardy’s proof from his Diary no. 26, p. 181. Hurwitz noticed that 
Hardy’s article had been published in the same issue as one of his papers [49] correcting an 
inaccuracy in Ludovic Zoretti’s book Legons sur le prolongement analytique from 1911. The 
sentence on top claims that Leibniz did not believe that the equation z+ + 1 = 0 could be solved 


using complex numbers 


valid for sufficiently large 7, where 


Z(t) := exp(iO(1))E(4 + it) 


with 


exp(iO(t)) := 


_# rqt+4 
IrG+l 


is real-valued. Hardy’s approach allows quantitative results, however, the so far 
best quantitative bound for the number of zeros on the critical line was obtained 
by Levinson’s method and its technical refinements.” 

Pélya’s paper [84] from 1918 deals with the zeros of entire functions represented 
by certain Fourier integrals. It even contains a theorem due to Hurwitz and his proof 


40See Iwaniec [52] for a recent presentation. 
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(see [84, p. 368]), namely: given an even function f(t) defined for t € (—1, 1) having 
alternating Fourier coefficients, i.e., 


fa ~ $ap + a, cos(zt) + ay cos(27t) + a3cos(3mt) +... 


with ay > 0,a, < 0,a2 > 0,a3 < 0,..., then the zeros of the entire function 


1 
va= f f(t) cos(zt)dt (2.1) 


are all real and simple, and such that in each of the intervals... ,(—22,—7), 
(—x,0), (0, 7), (1, 27),... contains exactly one zero. This result and its rather 
lengthy proof is related to Pélya’s reasoning; moreover, a paper of Jensen is 
mentioned. And the name Jensen comes up in another context as well. For his 
article [87] Pdélya has studied the mathematical papers left by Johan Ludwig William 
Valdemar Jensen.*' The autodidact Jensen never had an academic appointment, 
however, working for the International Bell Telephone Company at Copenhagen, 
he found time for severe mathematical research. In zeta-function theory the so- 
called Jensen formula plays an important role in estimating the number of zeros 
of an entire function in a given disk.” In 1911, during the Second Congress of 
Scandinavian Mathematicians at Copenhagen Jensen [53] announced five articles 
about new algebraic and function-theoretical methods with applications to the 
Riemann zeta-function (JV), however, none of these announcements was published 
until Jensen’s death in 1925. Probably inspired by his studies of Hurwitz’s estate, 
Pélya investigated the Jensen estate (JN) but could not find what he was aiming at. 


This all is to find in several small notebooks and a great number of loose pages such that 
in most cases it is impossible to fix in which sequence and which time the notes have been 
written down. As follows from this description the state of the estate does not allow to set 
its content with certainty.** 


It seems that Jensen was in some sense a messy mathematician as Hurwitz was a 
tidy character. This chaos did not stop Pélya’s investigations; he even wrote a survey 
on integral representations and their applications with respect to the Riemann zeta- 
function. In this context Polya gave credit to Hurwitz: 


I is contained in JV (p. 188), Il and III are often quoted in JN. For II and IV see P3. I and 
III were (as well as the consequences mentioned in Nr. 6 following Theorem III) known 


‘lt is interesting to notice that around the same time Carl Ludwig Siegel was investigating 
Riemann’s estate in Gottingen; his study [106] has changed the impression many contemporaries 
had about Riemann’s approach to number theory. 

* According to Narkiewicz [77], p. 258, Jensen’s formula was essentially known by Carl Gustav 
Jacobi already in 1827. 

43“Dies alles in einigen kleinen Heften und an einer grossen Anzahl von losen Bliattern, so dass 
in den allermeisten Fallen unméglich festzustellen ist, in welcher Reihenfolge und zu welchem 
Datum die Aufzeichnungen entstanden sind. Wie aus dieser Schilderung hervorgeht, erlaubt der 
Zustand des Nachlasses nicht, dessen Inhalt mit Sicherheit festzustellen.” 
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Fig. 14 Hurwitz’s Diary no. 17, p. 115. Formula (2.1) appears in disguised form in the lower third. 
Another entry of the diary, a few pages later, bears the date 3 December 1899 


by several friendly mathematicians; both appear in the mathematical diaries of A. Hurwitz 
(preserved by the library of the Eidg. Technischen Hochschule in Zurich) with date 1899.*4 


Indeed, in the Mathematical Diary no. 17, pp. 112-115, we can find the 
corresponding formula (see Fig. 14). 

Now we are in the position to explain what Pélya meant in writing “Very early 
Hurwitz had tried a path towards the Riemann hypothesis in vain which some years 
later also Jensen found” (as mentioned above). In 1889, Hurwitz [47] extended a 
classical result due to Siméon Poisson [81, p. 178], on the zeros of Bessel functions 
J, (whose definition we omit here), namely that all the zeros are real whenn > |. He 
extended and quantified Poisson’s result about the zero-distribution using Rouché’s 
theorem* and more advanced techniques; in particular he obtained his famous 


447 ist in JV (S. 188), II und III sind in JN 6fters hervorgehoben. Fiir II und IV vel. P3. II und 
III waren (nebst der in Nr. 6 im Anschluss an Satz II hervorgehobenen Konsequenzen) mehreren 
mit mir befreundeten Mathematikern bekannt; beide kommen in den Tagebiichern von A. Hurwitz 
(aufbewahrt in der Bibliothek der Eidg. Technischen Hochschule in Ziirich) unter dem Datum 1899 
vor,” [87, p. 11]. Here P3 is an abbreviation for [85]. 

45Which probably was unknown to him and better should be named after Augustin Louis Cauchy 
as already Eugéne Rouché had remarked; see Bottazini and Gray [7] for details. 
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theorem on uniformly convergent sequences of meromorphic functions having limit 
either identically vanishing or, more interestingly, every zero of the limit function 
is limit of the zeros of the functions in the sequence. This result is now a standard 
tool in the complex analytic tool box. Using this observation, Hurwitz was led to a 
separation of the real zeros as in the case of (2.1) above. And here is the link to his 
approach to the Riemann hypothesis. Hurwitz wrote: 


The proof of this theorem is, following Poisson’s process, usually based on an integral 
representation. . . *° 


We observe the similarity with the situation considered by Hardy and the entry 
in Diary no. 17 (Fig. 14) from above. In order to prove that all nontrivial zeros of 
the zeta-function are on the critical line one could aim at finding a suitable integral 
representation; this could be realized by giving criteria that an entire function (e.g., 
a Fourier transform of a kernel) has all its zeros on the real axis. This idea appears 
in the literature several times; however, it seems that Hurwitz was among the first 
to have considered this approach “in vain (...) the aim was unreachable” as Pélya 
remarked. 

In a subsequent paper [88], following an idea of Landau, Polya obtained the 
related remarkable result that all zeros of the function 


[o,@) 
E* (t) := 4n? / cosh(3u) exp(—2z cosh(2u)) cos(ut)dt 


are real; although this function is an approximation of 4(f), unfortunately this has 
no impact on the location of the zeros of the Riemann zeta-function. Nevertheless 
it reminds us of another related approach to the Riemann hypothesis. Based on a 
theorem due to Edmond Nicolas Laguerre on entire functions with real zeros, Polya 
[82] studied sequences of polynomials with positive real roots and proved that a 
uniform limit is either vanishing identically or an entire function of order zero or 
one (see (2.2) below). Interestingly, Pélya’s paper includes a certain criticism of 
Laguerre’s proof: 


The correctness of this proof can be doubted; I am of the opinion that the proof can be 
realized rigorously without any substantial changes. The here given proof is completely 
independent of the correctness of Laguerre’s reasoning.*” 


It should be mentioned that [82] had been written in 1913, Pélya’s last year in 
Gottingen before he moved to Zurich. Around the same time Jakob Grommer wrote 
his doctorate [32] supervised by Hilbert on this topic. He found an improvement of 
one of Hurwitz’s unpublished theorems on entire functions with real zeros only; the 


46“Der Beweis dieses Satzes pflegt man nach Poisson’s Vorgang auf eine Integralformel zu 
griinden. ..”; [47, p. 246]. 

47Die Richtigkeit dieses Beweises kann angezweifelt werden; ich bin der Ansicht, dass der Beweis 
ohne wesentliche Aenderungen des Gedankenganges liickenlos zu fiihren ist. Der hier gegebene 


Beweis ist von der Richtigkeit des Laguerreschen vollig unabhangig, und beruht auf einer ganz 
anderen Grundlage.’, [82, p. 280]. 
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original result and questions about its possible generalization were communicated 
by Otto Toeplitz and it was Hilbert who supported Grommer “who apart from his 
physical ailment faced triple jeopardy as a foreigner, orthodox Jew, and as someone 
without the equivalent of the Abitur, since he graduaded from a Talmudic school 
rather than a traditional Gymnasium” wrote Rowe and Schulmann [95]. Grommer’s 
thesis forms the second issue Polya had mentioned in his comment about the 
Hurwitz estate. 

It seems that these kind of questions were among the first that Hurwitz and Pélya 
discussed after the latter’s arrival at Zurich in 1913. In Diary no. 26 one can find 
an entry on “Pélya’s theorem” under date 26 February 1914 which is related to his 
publication [83]; actually, the same diary contains a note with respect to Laguerre’s 
theorem as well as an application of Charles Biehler’s theorem [5] on real zeros of 
entire functions with comment 


A pretty application of Biehler’s theorem came to me last night.** 


In the recent literature the so-called Laguerre—Polya class is defined as the set of 
entire functions E(s) being locally limits of a series of polynomials having only real 
roots. Such functions obey (like polynomials) the Hadamard product representation 


E(s) = s“ exp(a + bs + cs”) I] (1 - ~) exp ( : ) F (2.2) 


Sn Sn 


where a,b,c are constant, b and c real, the latter one zero or negative, d is the 
multiplicity of a (hypothetical) zero at s = O, and the s, denote the further zeros 
of E. There is a lot of literature on this class of entire functions and their zero- 
distribution; the impact on the Riemann zeta-function and allied functions has been 
investigated for decades, e.g., Louis de Branges [15] should be mentioned here. 
However, since this is not really related to Adolf Hurwitz and his mathematics, we 
refer here only to the survey article [12] by Csordas, Norfolk and Varga. In particular 
Pélya’s paper [82] has been extended and precised by Dennis Hejhal [45]. 

Another idea in the context of the Riemann hypothesis related to Pélya is the 
so-called Hilbert-Pélya conjecture that the ordinates of the nontrivial zeros of the 
Riemann zeta-function correspond to eigenvalues of an unbounded self-adjoint 
operator. This idea as well does not go back to conversations with Hurwitz but arose 
from Pélya’s time in Gottingen and a question by Landau.” 


3 Concluding Remarks 


Introducing analytic methods to number theory was a change of paradigm. The 
ideas of Euler, Dirichlet, and Riemann, however, were ahead the methods at their 
time. Open question about prime number distribution became a driving force for 


48“Fine hiibsche Anwendung des Biehler’schen Satzes fiel mir vergangene Nacht ein.” 
4°See the correspondence [89] between Andrew Odlyzko and Pélya on this topic. 
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the development of certain aspects of complex analysis. The distribution of zeros of 
zeta- and L-functions is still not understood very well. This might have two reasons: 
it is neither known what impact a Riemann type-functional equation on the zeros 
distribution actually has nor is it clear why arithmetically relevant Dirichlet series 
with an Euler product seem to have all its complex zeros on one single line.°” The 
functional equation alone cannot be the driving force for the complex zeros to lie on 
the critical line. Harold Davenport and Hans Heilbronn [14] showed that the Hurwitz 
zeta-function ¢(s, @) has infinitely many zeros in the half-plane Res > 1 provided 
aX 5. 1 is rational or transcendental; moreover, they showed the explicit existence 
of Dirichlet series satisfying a Riemann-type functional equation having infinitely 
many zeros off the critical line. It is expected that the Euler product should have a 
severe impact on the zero-distribution (although there is no absolute convergence of 
the Euler product inside the critical strip). 

The second proof Riemann gave for the functional equation of the zeta-function 
may be considered more important by its relation to modular forms. This line 
of investigation has been studied by Erich Hecke starting with his proof of the 
functional equation for Dedekind zeta-functions [43] in 1916/17 and the relation 
between Dirichlet series having a functional equation of the Riemann type and 
modular forms [44] in 1936. Nowadays the role of Dirichlet series for arithmetical 
questions is quite fairly understood. We quote 


In recent years L-functions and their analytic properties have assumed a central role in 
number theory and automorphic forms. 


from the abstract of Gelbart and Miller [31] which gives an excellent reading for the 
modern zeta-function theory. We shall briefly mention here John Tate’s thesis [109] 
in which a proof of the functional equation is given on the basis of local methods. 
This method can be interpreted as an analytic variation of the local global principle 
discovered by Helmut Hasse for quadratic forms. Another line of inquiry has been 
suggested by Robert P. Langlands in the 1970s with his “Langlands Program” 
claiming that all arithmetically relevant Z-functions (automorphic or motivic) 
arise as products of L-functions associated with automorphic representations; the 
celebrated proof of the Shimura and Taniyama conjecture due to Andrew Wiles et al. 
is a prominent example. For details we refer once again to [31]. On the contrary, the 
first proof Riemann gave in his path-breaking paper and Hurwitz’s generalization 
still gives (at least in the authors’ opinion) the most simple approach to Dirichlet 
L-functions (without using Poisson’s summation formula). 

We summarize. Shortly after receiving his doctorate in 1881 Adolf Hurwitz made 
new contacts and found research problems in new mathematical topics. One of them, 
zeta-function theory, should be a field of interest for the rest of his life although 
this is apart from his paper on the Hurwitz zeta-function not visible in his list of 
publications. His mathematical diaries and a few reviews as well as the remarks in 


0 Actually, the notion of an L-function is usually used for zeta-functions having an Euler product. 
An axiomatic setting for such arithmetically relevant Dirichlet series was provided by Atle Selberg 
[105]. It is legend that the letter “L” is chosen with respect to Dirichlet’s given name “Leujene.” 
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Fig. 15 Page 115 from the mathematical Diary no. 6, [51] 


Pélya’s papers, however, give proof for this liking. Hurwitz’s ideas concerning the 
Riemann zeta-function and its relatives have inspired many contemporaries and later 
mathematicians and traces can be found until today. Last not least, Hurwitz was in 
possession of fundamental insights about an important generalization of the variety 
of zeta-functions under investigation at the turn of the century. Unfortunately, he 
(probably) only communicated his results about these zeta-function to quadratic 
forms in a letter to Lipschitz but did not find time to polish and publish his notes; 
nowadays all the credit is given to Epstein. 
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Fig. 16 Page 116 from the mathematical Diary no. 6, [51] 


We conclude with another quotation from Pélya’s text introducing the mathemat- 
ical diaries of Adolf Hurwitz: 


Perhaps one will find in the diaries some more isolated pretty results and promising 
51 
germ... 
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Selberg Sums: A New Perspective 


Samuel J. Patterson 


To the memory of Wolfgang Schwarz 


Abstract Selberg sums are the analogues over finite fields of certain integrals 
studied by Selberg in 1940s. The original versions of these sums were introduced 
by R.J. Evans in 1981, and following an elegant idea of G.W. Anderson in 1991 
they were evaluated by Anderson, Evans and P.B. van Wamelen. In 2007 the author 
noted that these sums and certain generalizations of them appear in the study of 
the distribution of Gauss sums over a rational function field over a finite field. The 
distribution of Gauss sums is closely related to the distribution of the values of the 
discriminant of polynomials of a fixed degree. Here we shall take this up further. The 
main goal here is to establish the basic properties of Selberg sums and to formulate 
the problems which arise from this point of view. 
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1 Introduction 


The usual class of character sums known as Selberg sums was introduced in 1981 
by Evans [5]. They were the analogues over finite fields of a class, or rather several 
related classes, of integrals introduced by A. Selberg in 1944, [12, p. 204 ff.]. In fact 
he had already used an integral of this type in 1941 in [12, p. 74 ff.] but was uncertain 
as to whether the integral was already known and waited before publishing details. 
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Characteristic of these integrals is that they are taken over a space of monic 
polynomials of a fixed degree and that one factor in the integrand is a power of 
the discriminant. Selberg regarded the integrals he studied as an extension of Euler’s 
beta-function and, indeed, his evaluation gives the integrals as a quotient of products 
of gamma functions. The primary example is the integral 


[- fo “x41 =) +1 =) Yi 9) Pda 


i<j 


in the region Re(w) > 0, Re(B) > 0, Re(y) > —Min(¢, B®) | Rel) Selberg’s 
evaluation shows that the integral is equal to 


Il rd +iyr@+ G-Dy)r6 + G- Dy) 
an rd+y)F@t+p+G+ji-Dy) — 
For our purposes a transformation is helpful. This integral can be regarded as over 
all monic polynomials of degree i with all their roots in [0, 1]. We let o1,...,0; be 
the standard symmetric functions in x;,...,x;. Then one has 


doy A doz A+++ A do; = | [(j — x7) diy A day A+++ A da. 
isi’ 


It follows that we can write Selberg’s integral as 


[roriroviomrter. 


where the integral is over the set of polynomials described above and df is the 
standard Lebesgue measure on the affine space of the f. 

We can now describe Evans’ analogue. Let g be a power of the odd prime number 
p. Let y1, ¥2, and y3 be multiplicative characters on EF, and let w be the unique 
quadratic character. Then Evans introduces 


S> mF) x2F))(@x3)(D/)) 


where D(f) denotes the discriminant of the polynomial f and the sum is taken over 
all monic polynomials of a fixed degree i. It turns out that introducing the factor w 
makes the development much smoother and it can be regarded as the analogue of 
the term —} in the exponent of D(f) in the alternative version of Selberg’s integral. 

In 1990 Anderson [1] found an ingenious and remarkably simple method for 
evaluating some of these sums. Following this first Evans [6] and then van Wamelen 
[15] extended Anderson’s method to cover all cases. Details of the results are given 
in Sect. 3. 
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Since Selberg’s integral can be considered as an extension of the beta-integral we 
can regard Evans’ sum as an extension of their analogues over finite fields, namely 
Jacobi sums. One immediate generalization of the beta-integral is the standard 
integral representation of the hypergeometric function and their generalizations. In 
[10] it was pointed out that the finite field analogues of these functions intervene 
in the theory of metaplectic forms, and especially of Eisenstein series, over rational 
function fields over finite fields. In this chapter we shall follow this line of thought 
and apply the appellation “Selberg sum” to this more general class of sum. To the 
best of my knowledge the corresponding Archimedean functions have not been 
investigated. 

The theory of metaplectic forms leads us to the evaluation to some more general 
Selberg sums. This is an aspect which we shall not go into here; some indications 
are given in [11]. He we shall take up a theme of that paper, namely a transformation 
formula. This will be given in Sect. 4. It is quite elementary but among other things 
it shows that the results of Anderson, Evans, and v.Wamelen can be used to evaluate 
further interesting classes of Selberg sums. 

It is worth noting that it is relatively easy to compute specific examples of Selberg 
sums and so one can investigate them experimentally. We shall explain why this is so 
later. This means that one can investigate the sums experimentally and, for example, 
examine their sizes in various metrics. This is a topic which we shall postpone to a 
later paper. 

My thanks are due to Ben Brubaker, Paul Gunnells, and Zeev Rudnick for helpful 
conversations about the topics discussed in this paper. 


2 Notations 


Let, as before, g be a power of the odd prime g. For f,g € F,[x] let R(f, g) be 
the resultant of R(f, g) We recall that R(f, g) is a universal polynomial (Sylvester’s 
determinant) in the coefficients of f and g. The function g +> R(f, g) depends only 
on g (mod f). Also R(f, g) is bimultiplicative and satisfies the reciprocity law 


R(g,f) = (-1I) eM R(F, 2). 


Finally R(x—a,f) = f(a). These properties determine R completely and can be cast 
into the form of an efficient algorithm based on the model of continued fractions for 
its evaluation. 

For f € F,[x] let D(f) be the discriminant of f. If f is monic one has D(f) = 
n(deg(f))R(f,f’) where n(i) = 1 fori = 0,1 (mod 4) and n(i) = —1 fori = 2,3 
(mod 4).! 


‘Note that this factor is missing from the formula (5) on [14, p. 87], an oversight which 
unfortunately led to author to similar inaccuracies in [10]. 
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We let yz be the Mobius function on F,[x]. Then Pellet’s formula (see [2, (1.5)], 
[4, Lemma 4.1], or [13]) asserts that 


uf) = (-I)""Y aD) 


where, as above, w is the unique quadratic character on F. 
Let X, be the group of characters on FP which we consider as taking values in 


Q(1="). For x € Xz let ord(x) denote the order of y. Let e, be a nontrivial additive 
character on F,; for our purposes it is convenient to regard it as taking values in 


Q(17). The choice of e, will not play any significant role in our discussion. For 
x € X, we define the Gauss sum over F, to be 


u(x) = >> x(a)eo(a). 


acky 


For 71, X2 € Xq let 


In.) = YS u@n0-a) 
aéFy 
aA0,1 
be the Jacobi sum. Then if 7:72 4 1 
TXT (Xa) = IC, x2) (11 X2) 
whereas if 7; ¥2 = 1, but x1, ¥2 # 1 then 
t(K)t(X2) = 1D 
and 
J(q1, X2) = —y1(-1). 
Additionally one has J(y,1) = —1 if y # 1 and J(1,1) = q — 2. Also 
I(x1s 2) = JQ) JO. 22) = X2(-DI(1 22)" x2) and Iq, 42) = 
n(-)DJ (a. (4142) 1). There are some further relations and properties of these 


sums which have been studied in great detail. 
Let now k = F,(x) and R = F, [x]. We define 


e:k > Q(I?) 
by 


e(f) = CF as Res, (fdx)) 
= €,(—ReSoo (fdx)). 


Here, in the sum, v runs through the finite places of k. 
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We define for y € X, and g € R — {0} the Dirichlet character of modulus g 


fe x(f/8) = x(R(8.f))- 


We define the global Gauss sum associated with k to be 


s(r.y.c)= > x(d/c)e(rd/c) 


d (mod c) 


for r € R — {0}. This can, by means of the Davenport—Hasse theorem, be evaluated 
in terms of Gauss sums over F,. If r is coprime to c one obtains 


g(r, 60) = KO) AC/o)'X(C/0)(—tQ)) 2°. 


Here c’ denotes the derivative of c. This shows that the function y(c’/c) is as subtle 
a function of c as g(r, x,c). Recall that it is equal to y(y(deg(c)))x(D(c)) for c 
monic. We therefore have for c coprime to r 


g(r, x.0) = w(c)x(n(deg(c))) x(r/0)! (—t (QD) x(D(o)). 


In the special case y = w, r = | we obtain 


g(r, @,c) = w(c)a(n(deg(c))) tw)". 


As t(@) is ./q¢ times a fourth root of 1 this gives us the Gauss evaluation of the 
quadratic Gauss sum in this context. One could also use this argument to prove 
Pellet’s formula using that evaluation, as given, for example, in [16, XIII, Sect. 11, 
Proof of Theorem 13]. 

We can now define the Selberg sums which we are going to investigate here. 
Apart from a factor +1 it specializes to the one used in the papers of Anderson, 
Evans, and v. Wamelen when we take r to be of the form x°°(x— 1)*!. For 71, 72 it is 


> 1(r/c)@(D(©)) x2(D(0)) 


where c is summed over all monic polynomials of a fixed degree i. For our purposes 
it is convenient to modify the expression by using Pellet’s formula. It becomes 


(-1I)' )3 nO) 17/0) x2(D(0)), 


or, with the reciprocity law, 


(-Dix-D8 YS > war (/) 20D). 
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Both of the sums here could be considered as to be entitled to be the more 
fundamental one. We choose 


Se(r, 11.42.) = D> wOxC/0)x2(D(0). 


c¢ monic 


deg(c)=i 


We take y1(r/c) to be zero if r and c are not coprime. By means of the Davenport— 
Hasse theorem we have then 


Ser. x1, 42.1) = (Vita) 2(n@) YS F/e)8C. 72.0). 


c monic 


deg(c)=i 
If now there exists an exponent e so that 
xi = 1 
then 


Se(r, x1, 42.8) = (-V'tQ2) (n@) D> gr. 72.0) 


c¢ monic 
deg(c)=i 


where we restrict the sum to c coprime to r. In this case one can use the theory 
of metaplectic forms to investigate these sums, see [10, 11]. At the other extreme 
x2 = Land Se(r, x1, 1,1) is then the coefficient of g~® in the L-series L(s, x1 (r/+)) 
(over R). It follows from this that if 7; (r/-) is non-principal then Se(r, 71, 1,1) = 0 
for i > deg(ro) where ro is the conductor of x; (r/-)—[I, p. 471]. 

We should note that for r of the form 074 with 6 € Fy, a € N one has 


Se(r, X1, X21) = 1 (0)'Se(ro, Xf, X25 i). 


In particular we can reduce the calculation of Se(r, 71, 72, i) to the case where r is 
monic. 


3. The Anderson—Evans-—v.Wamelen Evaluation of Selberg 
Sums 


In this section we shall summarize the results of Anderson, Evans, and v.Wamelen 
in a notation suitable for the purposes in hand. We shall take r(x) = x (x—1)*!. The 
group X, is cyclic and the notation of these three authors is based on the choice of a 
generator denoted by t. The parameters a, b, and c are related to the parameters here 
bya v= X\', and t° = Zo. It is convenient to distinguish two cases. If 
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¢* = 47" and es X{' lie in the subgroup generated by t° = y» then we can define 
two integers fo and f; by °° re = 1 and 7}' x3 = 1. It turns out that it is this case, 
the one investigated by van Wamelen in [15], is precisely the one covered by the 
theory of metaplectic forms as formulated in [11]. We shall therefore refer to this as 
the metaplectic case. The alternative, when either 7° or x{' is not in the subgroup of 
X, generated by 72 we call non-metaplectic. This was the one investigated by Evans 
in [6]. This nomenclature is by no means satisfactory, nor even accurate, but it has 
developed from my private usage and I have not found anything better to propose. 
The distinction does seem to be a useful one as we shall see below. 

In the discussion below we shall write i instead of n as used by Anderson, Evans, 
and v.Wamelen. This corresponds to the usage in Sect. 2. We shall also use n, the 
order of 72, where they used d. For m € N andj € Z we write (j),, for the least 
non-negative residue of j modulo m. 

In the theory of Selberg sums a particularly important role is played by a product 
denoted by previous authors by P,,(a, b,c) and which we shall replace by 


eo J er J j+1 7 eg-ey I-14, 
P;(€0,€1, X15 X2) = I] TOM eat Ht Je Xa) ye ane uf) ) 


0<j<i 


This product corresponds to Selberg’s product of gamma functions. It is clear that 
for a certain A one has 


Pistn(€0,€15 Xs X2) = A’ Pileo, €1, X15 X2): 


One can express A in a simple form, [6, Lemma 3.1], 


A= q"*y2(n(n))/t(X2)" in the metaplectic case 


ein 


= SF a" x2(n(n))/t(2)"_ in the non-metaplectic case. 
Note that in the latter case at least one of y{°" and y{'" is nontrivial. Our notations 
are the reason that this looks a little different from the formulation in [6]. 
It is worth-while noting that in [15, Lemma 2] van Wamelen proves the additional 
evaluation that Py, +”+41(€0, €1, X1, X2) is equal to 


+ftl 
1) 


—qgfotfn-! y (1) fotfitl —— 
q mC) x2(nfo + fi + Daye 
These evaluations are consequences of the Davenport—Hasse analogue of the Gauss— 
Legendre multiplication formula for the gamma function. 
For y € N let 


; 
T(y,q) = -y + (2k + 1)(-q)* 


k=0 
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and 


S(iy,q) =1-(U—@y. 


From now on we shall assume the n > | as Anderson’s argument has to be modified 
in the exceptional case n = 1. For more details see Sect.5. The main result of the 
three authors is that in the non-metaplectic case 


Se — 1)", x1, X2,) 
is equal to 
ya (-1)!(-1)'Pio, €1, 21; X2) 


and in the metaplectic case it is equal to 


yi(—1)"(-Digh*l P,(eo, e1, Ni: X2) 


T(2[4].q) if On < Min(fo.A) < Max(fo.fi) < Go t+ fi -—i+ Dn 


T(2[4] + 1.9) if +A —i+ In < Min. fi) < Max(fo.fi) < @n 


S(+] +1,q) _ otherwise. 
We note here, following [15, Sect. 4], the following summations, 


U-(q,X) 


2 Pom Oe = Ta FH) 


m>0 


where 
Ue(q, X) = 2X7q* — gX* —3gX+X4+1, 
Ze + 1,q)x" = a og 
where 
Uo(q.X) = X?q? + PX — 3gX—q +2, 
and 
S © Stim, q)X” = are 


m>0 
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We note also that U.(q,q-*) = (1 — q7!)3, U-(q,1) = 2(¢—- 1)*, Ug.) = 
—q( — q7')? and U,(q,1) = 2(¢q — 1)?. These results mean that in the non- 
metaplectic case and in any complex embedding we have 


> S(x(x = 1)", X15 X25 i)X(—%0)/n 
i20 
1=l1g 

is equal to 


Wi IPT Pea tisgas 2) = as) 


where we assume that 0 < ig < n and A is as above. Note that |A| takes on, ifn 4 1, 


n 1 
5-1 gh-3 


one of the three values g?~!, g?~? or g? in any complex embedding. If n = | then 


1 7 . 
|A| takes on one of the three values 1, g2 or g in any complex embedding. 
In the metaplectic case we find that, according to the three cases above, the series 
is equal to 


Pi (€0, €15 X15 X2)Ue(q, X1(—D)*"(—1)"AX) 

(1 = yi(—1)e"(—1)"gAX)? (1 = ya (—1)1"(- 1)" AX)’ 
Pip (€0,€1, X15 X2)U0(G, XCD) ""(— I)" )AX 

(1 = ya(—1e"(—1)"gAX)? (1 = ya (—1) "(1)" AX)’ 


n(-D(-1)" 


pS Goal a a 
or 


Pig (Co, €1, X15 X2) + (4 = 2x1 (—1)"(—1)"gAX)? 


—])e10 (1) 
m(-1)1"(-1) (1 — yu (—D*"(—1)*gAX/? 


respectively. Note that the nature of the singularities reflect the type of the Selberg 


sum. As we have assumed that n > 1 we have in the metaplectic case |A| = q2~. 


4 The Transformation Formula 


We shall now turn to a property of Selberg sums analogous to Theorem | of [11]. 
We need some preparations in order to be able to formulate the result. Let n be the 
order of 72 and let n’ be the order of y;. Let yo be such that y; and re are in the 
group generated by 7o. Let a > 0 and b = 0 be such that 


11 = XO 12 = Xo- 
Let x monic and a prime in R = F, |x]. If |r then 


Se(x" r, 1, X2,0) = Se(r, x1, X2, 4). 
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If z Jr this is no longer true. We have that 
Se(r, x1, X2.i) — Se(x" r, 11, X21) 


is equal to 


Yo Hle®) x1(7/e17) x2(D(17)). 


deg(c)z)=i 
Since D(c,x) = D(c1)D()R(c1, 1)* this becomes 
—xr/m) 20D) YY wer) x14 e178) x2(7/€2)" x2(D(C1)) 
deg(c| )=i—deg(z) 


or 


—y1 (1/1) x2(D(z)Se(r*x”, Xo, X2, i — deg(z)). 


We regard this as a stability property of Selberg sums. 


Theorem 1 Let a, 6, y,5 € F, be such that A = a5 — —By # 0. Then for suitable 
integers M, M' we have that 


fe ax +B a 1 b(i-1)+M 
ae se(-(245) (Gp) ee 


is equal to 
Se(r(a)(—yx + ay", x1, X2,9) 


If y = 0 we can take M,M’ = 0. If y 4 0 then M is to such that ee = land 
M + b(i— 1) > a-deg(r), and M' is such that M' > 0 and x" = 1. 


Proof The proof is carried out by verifying the identity for 6 i) of the 
va 


form é ‘ ), of the form & ) and equal to (? .) and then combining these in 


the usual manner. 

The first two cases are straightforward; we need to only replace c in the sum 
defining the Selberg sum by c(x + A) and c(@x)0~ where 6 = a/6 respectively. 

In the case G i) — G a) we assume that r(0) = 0; this is the reason for 

Y 

the introduction of the parameter M’. This being so c is not divisible by x and so 
c(0) # 0. We replace c in the sum by c(x!)x‘/c(0) and recall that R(x, c) = c(0). 
The formula quoted now follows from the theory of resultants. 
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5 Selberg Sums and Metaplectic Groups 


The results of Sect.3 show that there is a large number of relationships between 
Selberg sums. The results of Anderson, Evans, and v. Wamelen show that polyno- 
mials with two rational zeros over F, can be evaluated explicitly. It turns out, as we 
shall see in the next section, that if the zeros are no longer rational a similar formula 
holds. The next case which one can investigate, as in [11], is that of polynomials 
with three rational zeros. In view of Theorem | these can be brought to the form 
x(x — 1)" (x—A)®. As in [11] there are a large number of relations between these, 
now a little more complicated as we have to move into the region of stability. They 
are the analogues of the relations between such sums similar to transformations of 
hypergeometric functions (cf. [17, Chap. XIV]). We should note that in this standard 
form such Selberg sums can be easily computed as the resultant can be evaluated 
by a continued-fraction type of recursion. This is implemented in, for example, the 
gp/PARI package. I hope to discuss the results of these calculations in a future paper. 

There are some further relations that are of interest. Let n,n’ be as in Sect. 3. If 
n’'|n then the generating series 


> Se(r, X1> X25 i)TOo)/ 


i=ig (mod n) 
i=0 


which converges in |T| < q~”, is, for each ig : 0 < i < n, a rational function in T. 
There is at most one singularity in |T| < q~”/? and this, if it exists, is simple and is 
located at T = q7! (—t(x5!))". These are the series investigated in [10] and [11]. 
The determination of the residue and the establishing of its properties is one of the 
major questions addressed in those papers and it is one that has, as yet, only been 
partially answered. 

If the condition n’|n is not satisfied then the series above can also be investigated 
by means of the theory of Eisenstein series. The point here is that one has to use 
an Eisenstein series of a “Nebentypus” depending on y; and 72. The general theory 
in contained in [8]. In fact in this case the constant terms of the Eisenstein series 
will be made up from holomorphic L-series. The analytic continuation then follows 
from the “principle of the constant term”; see [9, Theorem 4.8] for a statement and 
[7, Theorem 1.6.6] for a proof in the function-field case. The details have not been 
given explicitly but it seems very plausible that in this case one will be able to 
conclude that the series above also in this case represents a rational function but 
one now with no singularities in |7| < q~"/*. One expects that one will be able to 
determine the denominator and to estimate the degree of the numerator which are 
known in the metaplectic case also in the non-metaplectic case. 

It is instructive to examine the special case n = 1. There are two cases, the 
metaplectic case in which 7;(r/-) is a principal character and the other case when 
xi(r/-) is non-principal. In the first case we see that >.) Se(r, x1, X2,i)T' is 
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L(s, x1(r/-)~! with T = q~°. Let ry be the modulus of y;(r/-); then this series 
is equal to 


( _ yy [Ja _ gq ems) 


aro 


In this case the generating series is a polynomial. 

If x, (r/-) is non-principal then, if we suppose for the moment that the 7, (r/-) is 
primitive. Then Anderson, [1], has observed that L(s, x; (r/-) is a polynomial in q~* 
of degree deg(ry) — 1. It follows that L(s, x, (r/-)~! has this number of singularities, 
counted with multiplicities. They lie on g~* = q~'/? by Weil’s theorem. If the 
character is not primitive then there is a numerator of the form [](1 — q°¢8)) 
where the product is over all the primes dividing the modulus but not the conductor. 
In the case of the standard Selberg sums the degree of zr is | and it is of the form 1 — 
aq * where a is a Jacobi sum. The cases of other r will be much less straightforward. 

We return to the general case. The singularities of the generating series of 
the sequence Se(r, 1, X2,i) are given as certain sums of their coefficients, i.e., 
of Selberg sums. This has been described in [10, 11]. The theory of metaplectic 
groups then leads to a number of relations between the residues, the so-called Hecke 
relations. A consequence of this is that there are a number of unexpected relations 
between Selberg sums. Some examples are given in [10, Sect. 4]. To date there is no 
systematic method of treating these relationships. This is not unlike the situation in 
the theory of generalized hypergeometric functions. 

It is interesting to consider Anderson’s method in this connection. His approach 
is based on a formula which is derived from the theory of Dirichlet series over R. 
We shall now sketch his technique. Let f be a monic polynomial in R. Let y be as 
before and let n be its order. We regard y(f/-) as a Dirichlet character. Let f, be the 
conductor of f. This is [| where z runs through those monic prime divisors of 
f whose order in f is not divisible by n. We shall assume that f, 4 1. Then by [1, 
Proposition 2.1] 


ODE) Yee |] eG) 


1\fo 
is equal to 
Y> XCF/8). 
gideg(g)=deg(f,)—1 
g monic 


This is proved by using the functional equation for the L-function L(s, x(f/-)) and 
the fact that the latter is a polynomial to determine the coefficient of (q~*)*)—!. 
Anderson proves this in the case n = q— | but the proof is valid whenever x(f/-) is 
not principal. Anderson achieves the same generality through his parameter c. 
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Anderson’s proof of the formula for the Selberg sum exploits the evaluation of 
a double sum in two different ways. The argument is strongly reminiscent of the 
multiple Dirichlet series technique, as used, for example, in [3]. The crucial point of 
the argument is that r, should be quadratic and it follows that one can also evaluate 
Selberg sums explicitly when r is a power of an irreducible quadratic polynomial. 

It seems plausible that one could use Anderson’s method combined with 
elementary considerations to determine the properties of metaplectic Eisenstein 
series over rational function fields. In fact the “principle of the constant term” 
referred to above is relatively elementary and as the constant term is easy to study 
in this case even the Eisenstein series approach is relatively elementary. The method 
of multiple Dirichlet series is, at least for higher ranks, an important component in 
the study of Eisenstein series and so the applicability of both methods should not be 
surprising. At any rate it should reassure those unfamiliar with this theory that the 
method is, at heart, in this case at least, elementary. 
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Polignac Numbers, Conjectures of Erd6és 
on Gaps Between Primes, Arithmetic 
Progressions in Primes, and the Bounded 
Gap Conjecture 
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Dedicated to the memory of Professor Wolfgang Schwarz 


Abstract In the present work we prove a number of results about gaps between 
consecutive primes. The proofs need the method of Y. Zhang which led to the proof 
of infinitely many bounded gaps between primes. Several of the results refer to the 
so-called Polignac numbers which we define as those even integers which can be 
written in infinitely many ways as the difference of two consecutive primes. Others 
refer to several 60-70 years old conjecture of Paul Erdés about the distribution of 
the normalized gaps between consecutive primes and about the distribution of the 
ratio of consecutive primegaps. The methods involve an extended version of Zhangs 
method, a property of the GPY weights proved by the author a few years ago and 
other ideas as well. 


Keywords Gaps between primes ¢ Polignac conjecture * Polignac numbers 
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1 Introduction 


1.1. Recently Yitang Zhang proved the Bounded Gap Conjecture, a term formulated 
in recent works of mine, often in collaboration with D. Goldston and C. Yildirim. 
Let p, denote the nth prime, P the set of all primes and 


dn = Pn+1 — Pn (1.1) 


the nth difference between consecutive primes. 
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Theorem A (Zhang [28]) liminfd, < 7- 10’. 
noo 


Earlier, in a joint work with D. Goldston and C. Yildirim we showed [11] this 
under the deep unproved condition that primes have a distribution level greater than 
1/2, in other words, under the assumption that the exponent 1/2 in the Bombieri— 
Vinogradov Theorem can be improved. 


Theorem B (Goldston et al. [11]) Ifthe primes have a distribution level 0 > 1/2, 
then with a suitable explicit constant C(}) we have 


lim infd, < C(#). (1.2) 
noo 


If the Elliott-Halberstam Conjecture (EH) is true, i.e. 0 = 1 or, at least, } > 0.971, 
then 


liminfd, < 16. (1.3) 


n—>Co 


We say that # is a level of distribution of primes if 


Ss yo 1 a = 1.4 
max daar KA (ozX) (1.4) 


q<x9—¢ (4,.9)=1|p=a(mod q) 
psx 


for any A,é > 0. The Bombieri—Vinogradov Theorem asserts that } = 1/2 is a 
level of distribution and the Elliott-Halberstam Conjecture [4] asserts that } = 1 is 
also an admissible level. 

Somewhat later, in a joint work with Motohashi [19] we showed that in order to 
reach infinitely many bounded gaps between primes it is sufficient to show (1.4) for 
smooth moduli g, i.e. with moduli without any prime divisor > X?, where b can be 
chosen as an arbitrarily fixed constant. 

Zhang proved that (1.4) holds for smooth moduli with a } > 1/2 and 
consequently obtained Theorem A. (We have to mention that he did not mention 
our work [19] in his paper at all despite our notice, instead he published his proof of 
our result.) 

The present paper is a modified version of the author’s preprint of 27 May 
2013 (available at arXiv:1305.6289v1) written before the alternative proofs of the 
Bounded Gap Conjecture by Maynard [18] and Tao (unpublished) and it contains 
several results which can be reached by using (1.4) for smooth moduli with any 
v > 1/2 and among others ideas of the author from [20]. Several of these 
results (sometimes in modified forms) were mentioned in the author’s lecture in 
Hildesheim, in July of 2014. 

We mention that the method of Maynard—Tao led later to several similar results 
as our present ones (often coupled with other ideas, and often in stronger forms as 
well, although they might be weaker in some other sense). 
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2 History of the Problems: Formulation of the Results 
2.1 Approximations to the Twin Prime Conjecture 


The problem of finding small gaps between primes originates from the 


Twin Prime Conjecture lim infd, = 2. 
n—->oo 


Since by the prime number theorem the average gap size is logn, Hardy and 
Littlewood considered first already in 1926 in an unpublished manuscript (see, 
however [23]) the upper estimation of 


dn 
A, = liminf (2.1) 
n—>oo logn 


and showed A, < 2/3 under the assumption of the generalized Riemann hypothesis 
(GRH). 
Erdés [5] was the first to show unconditionally 


Ay 21 =e, eg 20 (2.2) 


with an unspecified but explicitly calculable positive constant co. The full history 
with about 12—15 improvements concerning the value of A, is contained in [11], so 
we list here just the most important steps: 


A, < 0.4666 (Bombieri and Davenport [1]), (2.3) 
A, < 0.2485 (Maier [17]). (2.4) 
This was the best result until 2005 when we showed in a joint work with D. 
Goldston and C. Yildirim what we called the small gap conjecture: 
Theorem A (Goldston et al. [11]) A; = 0. 
Soon after it we improved this to 
Theorem B (Goldston et al. [12]) We have 


dy 
lim inf ———_—_______ < 2.5 
n>oo (logn)!/2(log log n)? iad 
Finally a few years ago I improved the exponent to 3/7 and announced the result 
at the Journées Arithmétiques, Vilnius, 2011, and Turan Memorial Conference, 
Budapest, 2011. 


370 J. Pintz 


Theorem C (Pintz [22]) 


dy 
lim inf ———_———__”. < ow. 2.6 
n00 (log n)?/7 (log log n)4/7 si 2%) 
In a joint work with Farkas et al. [9] we also showed that essential new ideas are 


necessary to improve (2.6). 


2.2 Polignac Numbers 


The twin prime conjecture appeared already in a more general form in 1849 in a 
work of de Polignac. 
First we give two definitions. 


Definition D A positive even number 2k is a strong Polignac number, or briefly a 
Polignac number if d, = 2k for infinitely many values of n. 


Definition E A positive even number 2k is a weak Polignac number if it can be 
written as the difference of two primes in an infinitude of ways. 


The set of (strong) Polignac numbers will be denoted by D,, the set of weak 
Polignac numbers by D,,. (We have trivially D, C D,,.) 


Polignac’s Conjecture (de Polignac [2]) Every positive even integer is a (strong) 
Polignac number. 


Since the smallest weak Polignac number has to be a strong Polignac number, an 
easy consideration gives that using |A| (or sometimes #A) to denote the number of 
elements of a set A, the following proposition is true. 


Proposition The following three statements are equivalent: 


(i) the bounded gap Conjecture is true; 
(ii) there is at least one (strong) Polignac number, i.e. |D,| > 1; 
(iii) there is at least one weak Polignac number, i.e. |D,,| > 1. 


The above very simple proposition shows that the bounded gap conjecture 
itself leaves still many problems open about Polignac numbers or weak Polignac 
numbers, but solves the crucial problem that their set is nonempty. We will prove 
several unconditional results about the density and distribution of (strong) Polignac 
numbers. 


Theorem 2.6 There exists an explicitly calculable constant c such that for N > 
No we have at least cN Polignac numbers below N, i.e. Polignac numbers have a 
positive lower asymptotic density. 


Theorem 2.7 There exists an ineffective constant C’ such that every interval of type 
[M,M + C’] contains at least one Polignac number. 
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Remark As mentioned earlier, the term Polignac number means always strong 
Polignac numbers. 


2.3. The Normalized Value Distribution of d, 


The prime number theorem implies 


1 dy 
lim — 2 =1, (2.7) 


so it is natural to investigate the series d,,/ log n. Denoting by J the set of limit points 
of d,/ logn, Erdés [7] conjectured 


_ dh = hee (2.8) 


While Westzynthius [27] proved more than 80 years ago that oo € J, no finite 
limit point was known until 2005 when we showed the small gap conjecture, i.e. 
Theorem A [11], which is equivalent to 0 € J. 

Interestingly enough Erdés [7] and Ricci [24] proved independently about 60 
years ago that J has a positive Lebesgue measure. What I can show is a weaker form 


noo 


of Erdés’s conjecture (2.8). 


Theorem 2.8 There is an ineffective constant c > 0 such that 
[O,c] CJ. (2.9) 


Kalman Gyory asked me at the Turan Memorial Conference in Budapest, 2011 
whether it is possible to find a form of the above result which gives answers about 
the more subtle value-distribution of d, if we use a test-function f(n) < logn, 
f(n) > wasn o. 


Definition 2.9 Let F denote the class of functions f : Zt — Rt with a slow 
oscillation, when for every ¢ > 0 we have an N(e) > 0 such that 


(d-—e)f(N) <f@ <U+e)f(N) for N<n<2N, N>WN(e). (2.10) 


Theorem 2.10 For every functionf € F, f(n) < logn, lim f(n) = co we have an 
noo 


ineffective constant cp > 0 such that 


0. cf] C Jp i= | (2.11) 
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2.4 Comparison of Two Consecutive Values of d, 


Erdés and Turan proved 65 years ago [8] that d,41—d, changes sign infinitely often. 
This was soon improved by Erdés [6] to 


dn * dn 
liminf + <1 <limsup . (2.12) 


Oe n noo n 


Erdés [7] wrote 7 years later: “One would of course conjecture that 


ost F dn41 
lim inf =0, limsup — =o, (2.13) 


R70 n noo n 


dn41 


but these conjectures seem very difficult to prove.” 
In Sect. 6 I will show this in a much stronger form: 


Theorem 2.11 We have 


dn dn 

tim ing 2! (2.14) 
noo (logn)—! 

and 
jngap tl So, (2.15) 
n—>oo logn 


2.5 Arithmetic Progressions in the Sequence of Generalized 
Twin Primes 


Based on the method of Vinogradov [26] van der Corput [25] showed the existence 
of infinitely many 3-term arithmetic progressions in the sequence of primes. The 
problem of the existence of infinitely many k-term arithmetic progressions was open 
for all k > 4 until 2004, when Green and Tao [14] found their wonderful result that 
primes contain k-term arithmetic progressions for every k. I found recently [20] 
a common generalization of Green—Tao’s result and our Theorem B [12] under 
the deep assumption that there is a level * > 1/2 of the distribution of primes. 
Combining Zhang’s method with that of [20] I can prove now the following: 


Theorem 2.12 There isad < 7x10! such that there are arbitrarily long arithmetic 
progressions of primes with the property that p’ = p + d is the prime following p 
for each element of the progression. 
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3 Preparation for the Proofs 


All the proofs use some modified form of the conjecture of Dickson [3] about k- 
tuples of primes. (His original conjecture referred for linear forms with integer 
coefficients.) Let H = {hi}, be a k-tuple of different non-negative integers. We 
call H admissible, if 


k 
Px(n) = | [hi (3.1) 
i=1 


has no fixed prime divisor, that is, if the number v,(7) of residue classes covered 
by H mod p satisfies 
vp(H) <p for pe P. (3.2) 


This is equivalent to the fact that the singular series 


-k 
6(H) =|] (1 - “) (: = ~) >0. (3.3) 


: P P 


Dickson conjectured that if H is admissible, then all n + h; will be primes 
simultaneously for infinitely many values of n. 

Hardy and Littlewood [16], probably unaware of Dickson’s conjecture, formu- 
lated a quantitative version of it, according to which 


x 


mu(x)= >> 1=(6(H)+0()) (3.4) 


n<x 
n+h,€P (1Si<k) 


log* x’ 


In the work [11] we attacked (but missed by a hair’s breadth) the following 
weaker form of Dickson’s conjecture which I called 


Conjecture DHL (k, 2) If H is an admissible k-tuple, then n + H contains at least 
two primes for infinitely many values of n. 


It is clear that if DHL(k, 2) is proved for any k (or even for any single k-tuple 
H,), then the Bounded Gap Conjecture is true. DHL(k, 2) was shown very recently 
by Zhang [28] for k > ky = 3.5 10° and this implied his Theorem A, the infinitude 
of gaps of size < 7-10’. 

However, results of this type cannot exclude the existence of other primes and 
therefore give information on numbers expressible as difference of two primes, in 
the optimal case of Zhang’s very strong Theorem A prove the existence of many 
weak Polignac numbers. However, they do not provide more information about 
(strong) Polignac numbers than the very deep fact that D, # @ and they do not help 
in showing any of Theorems 2.6—2.12. For example, in case of Theorem 2.12 they do 
not yield, let say, 4-term arithmetic progressions of primes and a bounded number d 
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such that p + d should also be prime for all four elements of the progression (even 
if we do not require that p and p + d should be consecutive primes). 

By a combination of the ideas of D. Goldston, C. Yildirim, Y. Zhang and mine, 
I am able to show a much stronger form of Conjecture DHL(k, 2) which may be 
applied towards the proof of Theorems 2.6—2.12. In case of Theorem 2.12 the 
ground-breaking ideas of Green and Tao [14] have to be used too, of course. 

In view of Zhang’s recent result the stronger form of Conjecture DHL(k, 2) (to 
be proved below) will be called Conjecture DHL* (k, 2). 

Let P” (m) denote the smallest prime factor of m. 


Conjecture DHL* (k,2) Let k > 2 and H = {h;}‘_, be any admissible k-tuple, 
N € Zt, € > O sufficiently small (¢ < €0) 


k 
HC[0,H], H<elogN, Px(n)=[|[(n+h). (3.5) 


i=1 


We have then positive constants c)(k) and c2(k) such that the number of integers 
n € [N,2N) such that n + H contains at least two consecutive primes and almost 
primes in each components (i.e. P~ (P3,(n)) > n“!) is at least 


c2(k)G(H) for N > No(H). (3.6) 


log* N 


One can see that we have a looser condition than in DHL(k, 2) as far as the 
elements of H are allowed to tend to infinity as fast as ¢ log N. On the other hand, 
we get stronger consequences as 


1. we can prescribe that the two primes n + h; and n + h; in our k-tuple should be 
consecutive; 

2. we have almost primes in each component n + h;; 

3. we get the lower estimate (3.6) for the number of the required n’s with the above 
property. 


The condition n € [N, 2N) makes usually no problem but in case of the existence 
of Siegel zeros some extra care is needed if we would like to have effective results 
(see Sect. 8 for this). 

After this it is easy to formulate (but not to prove) our 


Main Theorem Conjecture DHL*(k, 2) is true for k > 3.5 x 10°. 


Proof Since this result contains Zhang’s Theorem and even more, it is easy to guess 
that a self-contained proof would be hopelessly long (and difficult). We therefore 
try to describe only the changes compared to different earlier works. 

The first pillar of Zhang’s work is the method of proof of our Theorem B. 
Although he supposes 1 as a constant the method of proof of Theorem B (see 
Propositions | and 2 in [11]) allows beyond H < log WN (required by (3.5) above) 
the much looser condition H « N!/4~* for any e > 0. 


Polignac Numbers, Conjectures of Erdés on Gaps Between Primes 375 


The second pillar of Zhang’s work is to show that distribution of primes 
according to non-smooth moduli, i.e. with some prime divisor > N? for any fixed 
small constant b, can be neglected. As mentioned in the Introduction we showed this 
already much earlier in a joint work with Motohashi [19]. This work also supposed 
H to be a constant but the only place where actually more care is needed in the proof 
is (3.11) of [19]. On the other hand, allowing here the condition 


H < logN, (3.7) 
the same simple argument as in Sect. 6 of [11] adds an additional error term 
k’ logloglogR < logloglogR (3.8) 


to the right-hand side of (3.11) of [19] which is far less than the present error 
term log Ro = log R/(log log R)°. Otherwise the proof works without any change, 
everything remains uniform under our condition (3.7) above. 


Remark The crucial Lemmas 3 and 4 of our work [19] contain an additional 
factor y(n, #1). However, by the definition (4.17) of [19] we have y(n,H) = 1 
if P~ (Px(n)) > R" for any fixed 7 > 0. In such a way the extra factor y(n, H) 
does not affect the validity of our Lemma 3.2 below since the asymptotic provided 
by Lemma 3.3 for the right-hand side of (3.11) (and similarly the analogue of it 
for primes, Lemma 3.4) is the same as if we used the constant weight 1 instead of 
y(n, H). 


No change is required in the third pillar of Zhang’s work where he proves some 
sort of extension of the Bombieri—Vinogradov theorem for smooth moduli and the 
residue classes appearing by the method of Theorem B. 

However, the proof of Theorem DHL*(k, 2) (for k large enough, k > 3.5- 10°) 
requires a further important idea, namely Lemmas 3-4 of the author’s work [20]. 
This we formulate now as 


Lemma 3.1 Let N° < R < /N/p(logN), p € P, p < R® with a sufficiently 
small positive co and sufficiently large C. Then we have with the notation 


1 R\tt? 
Ar(n;H,k + £) = &+ 5! ~ H(d) (ioe *) (€ <k) (3.9) 


d<R,d|P3(n) 
the relation 
lo 
Yl ARH O? Kj EP Agi Hk + 0. (3.10) 
plogR 
n€[N,2N) n€[N,2N) 


PIP (n) 


Lemma 3.1 immediately implies 
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Lemma 3.2 Let N° < R < N'/2*”(logN)~©, 1 > 0. We have then 


Si Anns Hek +O? Ken YO AnH. k + 6). (3.11) 


n€[N,2N) n€[N,2N) 
P~ (P3,(n))<R" 


Remark Lemmas 3.1 and 3.2 were already proved in [20] under the loose condition 
H <logN. 


Lemma 3.2 asserts that numbers n where Py(n) has a prime factor < R” 
(equivalently < N”) with a small enough value of 7 (or b) might be neglected, since 
the weight used in all proofs is actually of type (3.9). The value of 7 (or b) depends 
on k. 

These results play a crucial role in the common generalization of the Green— 
Tao theorem and of our Theorem B (cf. [20]) and also in the proof that prime gaps 
< € log p form a positive proportion of all gaps for any ¢ > 0 (proved in a joint work 
with Goldston and Yildirim). 

These four pillars lead finally to the stronger form of Theorem DHL* (k, 2) if we 
combine it with a standard assertion following from Selberg’s sieve, which we can 
formulate in this special case as 


Lemma 3.3 Let 0 < a < 1/2 be any constant. Then 
Na-* 
yo 1«& = 56H). (3.12) 


k 
n€[N,2N) log N 
P (P34(n))>N® 


Proof This is Theorem 5.1 of [15] or Theorem 2 in Sect. 2.2.2 of [13]. This is also 
valid if we assume only H < log N. 


We further need a generalization of Gallagher’s theorem proved by the author 
which we formulate as 


Lemma 3.4 Let H, be an arbitrary admissible k-tuple with 


H = Hy S [0, A]. (3.13) 
Then we have for any n > 0 
1 4 GUN) 
Su) = 2 Ce 1+ O(n) (3.14) 
if 
H > exp(k!/"), (3.15) 


Proof This is Theorem 1 of [21]. 
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Remark As it is easy to see Lemma 3.4 implies Gallagher’s classical theorem [10] 
on the singular series. 


Combining the proofs of Theorems A and B (in the modified forms mentioned 
above) with the assertion of Lemma 3.2 we obtain under the weaker condition (3.6), 
i.e. for all admissible k-tuples Hy = {hi}, hi < his, 


H = Hx © (0, clog N], (3.16) 
at least 
N 
kK)G(H 3.17 
a SOT (3.17) 


numbers  € [N, 2N) such that n+ contains at least two primes and almost primes 
in each components, i.e. 


P-(Px(n)) > n°, (3.18) 


However, we must show the same with consecutive primes as well. 
We can define for any subset V of {1,2,..., k} the set 


VIN) = {n€[N,2N): n+heP oie V3. (3.19) 


Since k is bounded the number of possible subsets V is also bounded, therefore we 
can choose a Vo such that 


Vo C {1,2,...,4}, [Vol = 2, [Vo(W)| = c3(K)G(H) (3.20) 


log N" 


Choosing two arbitrary consecutive elements i,j € Vo(N) with i < j we have at 
least 


N 


numbers n € [N, 2N) such that for any pu € (i,j) 
nt+h, €P. (3.22) 


We have to assure, however, additionally that for a positive proportion of these 
numbers n we have also 


nt+h€éP forh€H, hi<h<h. (3.23) 
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Applying Lemma 3.3 for all these values h and summing up we arrive at the 
conclusion that 


ST yy Us COW a es G(HU h) (3.24) 


né[N,2N) h; ‘ NV njcheh; 
cy (k) <A< Lj 
P~ (Pog (n)) >No Bi 


— 


< 2C4() FN 


by Lemma 3.4. This shows that if ¢ was chosen sufficiently small depending on 
k, i.e. 


€ < €0(k) (3.25) 


then our original primes n + h; and n + hj are consecutive for at least 


(3.26) 


elements n € [N, 2N), thereby showing our Main Theorem. 


4 Polignac Numbers 


Proof of Theorem 2.6 
The fact that DHL*(k, 2) implies the positivity of the asymptotic lower density 
of Polignac numbers is expressed as Corollary | of [20] with the value 


1 1 
kk = pil ( 3) = 


and proved in Sect. 11 of [20]. The above value is about e~”/(k? logk) for large 
values of k. (If k —> oo, they are asymptotically equal.) 

Proof of Theorem 2.7 

We again suppose DHL*(k, 2) for k > ko. 

The reasoning is more intricate in this case and the resulting value C is 
ineffective. The trivial relation 


hy —hy = (k— 1) min(h; — hj) (hi < hiti) (4.2) 
I>] 
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even gives the impression that the best we can prove about localization of Polignac 
numbers is an interval of type 


[M, (k —1)M]. (4.3) 


Let us suppose that Theorem 2.7 is false. Then we have for any Cp > 0 an infinite 
series of intervals 


I,:=[M,,M,+C,], My >Cy,>4My-1, Mi > Cp (4.4) 


such that 


D.n (U 1) = 6. (4.5) 


For p > k we have clearly 
vp(Hx) < p for p € P, (4.6) 


so we have no problem of choosing an admissible system H, in a sufficiently long 
interval (e.g. if C, is large enough). Let 


He = (yYoars ty € 0, = [My + C)/2,M, + O], 42 
For h, € ly, hy € I, v < p we have then 
hy — hy € [My + Cy/2— 2My-1,My + Cy] C Ih. (4.8) 
Since in case of k > ko at least one of the numbers 


h,-hy (<v<p<h (4.9) 


can be written as a difference of two consecutive primes, (4.8) contradicts to (4.5) 
and thus proves our theorem. 


5 The Normalized Value Distribution of d,, 


Proof of Theorems 2.8—2.10 

Since Theorem 2.10 implies Theorem 2.8 it is sufficient to prove the latter. The 
structure of the proof will follow that of Theorem 2.7 proved in the previous section. 
Suppose that Theorem 2.10 is false. In this case we have for a sufficiently small 
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c* > 0 an infinite series of intervals 
Iu = leuty + dy|y ye 4b, > Weai, cr ec" (5.1) 


such that for K large enough 


le.e) K 
ot. A (U a) = @, where N = N(K) > 0. (5.2) 
Let 
La) :=lef@),G+te6y@|. v= 1,2)....k. (5.3) 


Then we have 


K 
dn ¢ |_J h(n) for v = 1,2,...,K, n €[N,2N), N>N(W). (5.4) 


v=1 


Using our Main Theorem, similarly to (4.7) we can construct an admissible k-tuple 
Hy, = ths with 3.5- 10° < k < K such that hy > hy > --- > Ay and witha 
sufficiently small e > 0 


hy, € T,(N) = | (« + >) ad + e)f(N), (cy + by) = erro) : (5.5) 


For hy € La hy € I’, w < v we have for N > max(N(K), No(é)) 


vy? 


5 
fy —hy € | (<» Sr 2ey1) (1 + e)f(), (cu + 8,)(1 — e)f(N) | == FEN). 


(5.6) 
Since we have for any 1 < yu < v < ko and anyn € [N, 2N) 


Ie(N) C In), (5.7) 


the fact that by the Main Theorem h,, — h, = d, for somen € [N,2N), 1 < p < 
v < ky contradicts to (5.4) and thus proves Theorems 2.8 and 2.10. 


6 Comparison of Two Consecutive Values of d,, 


Proof of Theorem 2.11 
Since the proof of the two inequalities is completely analogous, we will only 
prove the second one. The basis of it is the Main Theorem proved in Sect. 3. We can 
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start with an arbitrary admissible k-tuple H = Hy = {hi}4,, hi < ty < +++ < My 


with k > 3.5- 10°. Let with a fixed sufficiently small cj (k) define 


B(i,j,N) = {n<N,n+h,€P.nt+h € P,P (Py(n)) > nO, (6.1) 


|B(i, j, N)| log’ N 


W >0 (6.2) 


T = 3(@/J);j > i, lim sup 
N->0oo 


and let us choose the pair (i, /) with maximal value of j, afterwards that with maximal 
value of i < j. Then for any h, € (hj, hj) (.e.i < pu <j) we have clearly 


i |B(u.j,N)|log*N 
im sup ———————_ = 


0 (6.3) 
N->0oo N 


so all components n + h, between n + h; and n + h; are almost always composite if 
n € B(i,j,N) and N = N, — oo through a suitable sequence N,. 
On the other hand, if we have an arbitrary h € (hj, hj), h ¢ H, then the assumption 
n+heéP implies forHt =HUh 
P-(Py+(n)) > n®, (6.4) 


However, by Lemma 3.3 the number of such n < N is for all N 


G(HU {hy)N G(H)N log hy 
“Kker Toghrly Sher yg gkt ly “Sheu ao eTay Ae 


This together with (6.3) shows that we have at least 


N 
(ci(k, H) + oD) Hy (6.6) 


values n < N with n + hj, n + h; being consecutive primes for some sequence 
N=N,->-o@. 
Let us consider now these differences. Let 
n+h=pyeP, nt+h=p4i€P, d=h-h <1 (6.7) 
where 


logn ~ logv ~ logN. (6.8) 


Suppose now that the second inequality of Theorem 2.11 is false. Then we have 
for all those values of v with an arbitrary e > 0 for N > N(e) 


dy41 <d,elogN < CelogN, C= —hy. (6.9) 
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The already quoted sieve of Selberg (Lemma 3.3) gives an upper estimate how often 
this might happen for any particular value 


dy4, =d < Celogn. (6.10) 


Adding it up until Ce log N we obtain at most 


Ce log N 


N 
Se age 2 2 GCHUh) Key SH) IN’ (6.11) 


by Lemma 3.4. This means that in view of (6.6) this cannot hold for all N = N,—> oo. 
This contradiction proves Theorem 2.11. 


7 Arbitrarily Long Arithmetic Progressions of Generalized 
Twin Primes 


Proof of Theorem 2.12 

In this case we have to use again our crucial Main Theorem and the rest of 
the machinery executed in Sect. 7 of [20]. This yields the combination of Zhang’s 
theorem with that of Green and Tao [14]. 


8 How to Make Zhang’s Theorem Effective? 


Our last point is that in its original form Zhang’s theorem is ineffective since it uses 
Siegel’s theorem. His result and similarly to it all results of the present work can be 
made effective in the following way. 

According to the famous theorem of Landau—Page there is at most one real 
primitive character y with a modulus g (and the characters induced by it) which 
might cause ineffectivity in the Bombieri—Vinogradov theorem, and if it causes 
ineffectivity, then this modulus has to satisfy 


(log X)* < q < (log x)? (8.1) 


for any w(X) — co as X > oo. This modulus can cause any problem only in the 
case 


q| Px(n). (8.2) 
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However, our Lemma 3.3 says that we can neglect all numbers n with 
P-(Pu(n)) <n, (8.3) 


so (prescribing additionally q + d in the definition (3.9) of Ar(n;H,k, £)) both 
Zhang’s theorem and all our present results become effective. 


Remark It is an interesting phenomenon that in the sieving process yielding 
bounded gaps between primes and in all our present results we can choose 


Ag=0 (8.4) 
if d has either a prime divisor 

> New (8.5) 
or if it has a prime divisor 

<NO®, (8.6) 
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Idempotents and Congruence ax = b (mod n) 


Stefan Porubsky 


Dedicated to the memory of Professor Wolfgang Schwarz 


Abstract Alomair et al. (J Math Cryptol 4(2):121—148, 2010, Lemma 3.1) noticed 
the following result which seems not to appear previously explicitly in the literature: 
Given a nonzero a € Z,, the ring of residues modulo n, such that ged(a,n) = d|b, 
not only there exists an element x € Z, such that x-a = b (mod n), but that there 
even exists an invertible element x € Z* such that x-a = b (mod n). Their sufficient 
and necessary condition for this says that gcd(b/d,n/d) = 1 with d as above. 

A typical structure result on finite commutative semigroup says that the multi- 
plicative semigroup of Z,, decomposes into the so-called maximal subsemigroups 
belonging to the idempotents of Z,. Each such semigroup contains a maximal 
subgroup having for its identity the corresponding idempotent. In general this 
subgroup is a proper subset of the maximal subsemigroup containing it. However, 
the group of elements of Z, coprime to n is an example of the case when 
this maximal subsemigroup and the maximal subgroup coincide (both evidently 
belonging to the idempotent 1). 

In what follows we prove that if a congruence x-a = b (mod n) is solvable there 
always exists a solution in the maximal semigroup belonging to the idempotent 
given by the divisor 6 = gcd(b/d,n/d) and if 6 is a unitary divisor of n then there 
even exists a solution in the maximal subgroup belonging to the idempotent given 
by 6. 
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11A05, 20M14 
God does arithmetic. (‘O @e0f "ApiOuntiéer.) 
C.F. Gauss 


1 Starting Modifications of a Classical Result 


The following result belongs to standard requisites of elementary number theory 
(see [6, Theorem 57] and its proof): 


Lemma 1.1 /fa,b,n € Z, and gcd(a,n) = d, then the congruence! 


ax = b (mod n) (1.1) 


is solvable if and only if d\b. In the affirmative case it has just d solutions. If one 
solution is Xo, then the all incongruent ones are 


hig tO Sah (1.2) 
Xo,X0 + =X nh -—1)- : 
00% + 7X0 Fe 0 7 

Alomair et al. [1, Lemmas 3.1 and 5.5.] noticed the following surprising result 
which seems not to appear previously explicitly in the literature and which they 
applied in a construction of hash functions: 


Theorem 1.2 [fd = gcd(a,n) and d\b, then the congruence (1.1) has a solution 
coprime to n if and only if gcd (3, “) = 1, or equivalently if and only if gcd(a,n) = 
gced(b, n). 


The original proof of this simple result given by Alomair et al. [1] was 
surprisingly several pages long. A several lines proof of a slightly more general 
result was given in [5].” Here we present another proof which simplifies the original 
proof given by Alomair et al. [1]. Its idea will be used in the subsequent proof of a 
related converse question. 


Proof (Proof of Theorem 1.2) Let 


a=ad, b=bid, n=n<d. (1.3) 


'To simplify the notation and presentation all moduli and divisors will be always assumed to be 
positive in what follows. 

*In [5, Theorem 2] actually the following extension was proved: (1.1) has a solution coprime to 
n if and only if gcd(a,n) = gced(b, n). If this condition is satisfied, then there are exactly £9(5) 
incongruent solutions of (1.1) coprime to n, where 6 is the largest divisor of d with gcd(6, 4) = 1, 
and y(m) is the number of integers k, 1 < k < m, coprime to m. For another generalization consult 


[10]. 
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Then (1.1) reduces to 
a,x = b; (mod n,) with ged(a;,n,) = 1. (1.4) 


If gcd(bi, 7) > 1, then the condition gcd(a;,n,) = 1 implies that every prime 
p dividing gcd(b,, 7) necessarily divides every solution x to (1.1). To prove the 
sufficiency suppose that gcd(b;,,) = 1. Divide the primes dividing n and J, that is 
primes dividing Icm|[d, n,, b,| into the following four groups: 


— primes dividing d 


— let p denote such a prime that p|d and p + m1, but possibly p|b1, 
— let y denote such a prime that y|d and y|n;, consequently y + by, 


— primes not dividing d 


— let € denote such a prime that ¢|m,, then ¢ + b, (clearly ¢|n), 
— let r denote such a prime that r|b,, then r } n (clearly rb). 


The modulus n can be thus divisible only by primes denoted by p, y, or ¢. Since 
gcd(bi,n,) = 1 (and gcd(a;,n,) = 1), every solution x to (1.4) is coprime to n,. 
But y’s and ¢’s divide n;, and therefore xo cannot be divisible by some y or ¢. Thus 
to find a solution coprime to n it is enough to exclude only the primes p as possible 
divisors. 

All solutions of (1.1) are of the form 


Xp = x9 + kn; (mod ny), withk =0,1,...,d—1. (1.5) 


If no prime of the group p divides xo the proof is finished. In the opposite case, let 
P1,P2,...,Ps be those amongst the p’s which divide xo, and p/,,p5,...,p, which do 
not. Take in (1.5) for k a solution, say k*, to the system 


nik = 1 (mod pj), P= Nhawedy (1.6) 
nk =1—xo (mod pj), j=1,...,£. (1.7) 
Condition (1.6) assures that p; + kn, for alli = 1,...,5, and (1.7) that p; + xo + 
kn, for j = 1,...,€. Since p;--+psp---p) < d there exists a solution k* to (1.6) 


and (1.7) with 0 < k* < p,---psp-++p) — 1 < d, and the corresponding x, = 
xo + k*n, fulfills the requirements of statement to be proved. 


Since all solutions, if any, to (1.1) form an arithmetic progression (1.2), a reversed 
question can be asked with an affirmative answer: 
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Theorem 1.3 Let 
Xo tkN, k=0,1,2,..., (1.8) 


be an arithmetic sequence with gcd(xo,N) = 1. Then to every multiple M of N 
there exists a term of the sequence (1.8) such that gcd(xp + kN,M) = 1 andk € 
{0,1,...,.4—1}. 


A cursory argument based on Dirichlet theorem on primes in an arithmetic 
progression says that there is always an element (even a prime) in (1.8) which is 
coprime to M, but it may be very far from the beginning of the sequence. Famous 
Linnik’s theorem says that the least prime p(a,b) in the arithmetic progression 
a+ kb, with gcd(a,b) = 1 and 1 < a < b, where k runs through the positive 
integers, satisfies p(a, b) < cb” with effectively computable constants c and L. It is 
conjectured that p(a,b) < b?. Clearly, such a least prime in (1.8) is not necessarily 
coprime to the given M. Nevertheless, we can proof Theorem 1.3 using elementary 
means only. 


Proof (Proof of Theorem 1.3) Divide the primes dividing M = u - N into four 
groups 


1. pIN and p|# 
2. €|N and ¢ + 4 
3. y + Nand y|l4# 
4. e|(%0,M) 


Since ged(xo,N) = 1, neither p nor ¢ can divide x9 + kN for a non-negative k. 
Similarly 0|¥. 

If the set of primes y and Q is empty, then gcd(xo,M) = 1. If in the opposite 
case one of them is non-empty and if y;,..., ys are all the primes of the third type, 
and 01, Q2,...,@Q¢ all the primes of the fourth type, then any & solving the system of 
congruences 


xo +kN =1 (mod y,), i=1,...,s, ifs > 1 
kN =1 (modg,;), j=1l,...,€, if€>1 


yields the required term of the given sequence. 


In what follows we shall work with the ring structure of Z,, the ring of residue 
classes modulo n, where n € Z,n # 0. To work with its elements it will be 
sometimes more convenient to use the notation, if x € Z, then [x], will denote the 
residue class modulo n containing x. If the modulus n will be clear from the context 
we shall use the abbreviation [x]. 

As it will follow from the lines below the multiplicative semigroup (Z,,-) of 
the ring Z, contains in the general case more subgroups as it is usually taken into 
account in the standard number theory, where the group of reduced residue classes 
modulo n or maximally together with the trivial subgroup {0} are typical prototypes 
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of a subgroup of (Z,,,-). Denote by G,, (1) the multiplicative group of reduced residue 
classes modulo n. 

The following elementary result, which reproves Lemma 2.1 of [11] (see also [8, 
Lemma 2.1]*), will be used in what follows: 


Corollary 1.4 [fn,x € Z, then 
Xn = [ta]n, 


where t = gcd(n,x) and [a], € G,(1). In other words, given ann € Z, then for 
every x € Z there exists an integer a coprime to n such that x = ta (mod n), where 
t = gcd(x, n). 


Proof Write x = tb. Then Theorem 1.3 implies that in the progression > + k+ there 
is an a coprime to n. Clearly, x = ta (mod n). 


Example 1.5 Let n = 420 and x = 36 in the ring Z of rational integers. Then 
ged(420, 36) = 12. Since ($,420) # 1, 36 = 12.3 is not the representation 
of x = 36 in the spirit of the Corollary. From the solutions 3, 38, 73, 108, 143, 
178, 213, 248, 283, 318, 353, 388 to the congruence 3 = a (mod 35) incongruent 


mod 420 only 73, 143, 283, 353 are coprime to 420.* 


The divisor t = (n,x) of n will be called the divisor to which x € Z belongs 
modulo n. 

Our aim is to show that the result of Theorem 1.2 can be further extended. The 
first-stage extension is based on the fact that the subgroup Z* of integers coprime to 
n—as already indicated—is not the only subgroup of the multiplicative semigroup 
Z,, of integers modulo N worth to investigate. A further immediate generalization 
level are the commutative principal ideal rings with identity satisfying the following 
finiteness condition: 


(FN) For every nonzero ideal 3 C R the residue class ring R/9J is finite. 


This generalization can be done along the presented lines almost verbatim using the 
tools developed in [8]. A generalization to more general classes of commutative ring 
will be a subject of a forthcoming paper. 


3In [11] 8. Schwarz deals only with integers. In [8] the theory developed in [11] is extended and 
generalized to a wider class of commutative rings. The author of [11], as one of founders of the 
semigroup theory (cf. [7] or [2, 3]), tried to minimalize the usage of the addition operation in the 
proofs, while in [8] we based the reasoning on an unrestricted usage of the ring structure of the 
set of integers and its generalizations. This approach led to another symbolics of which simplified 
version will be used in this paper. For an overview of results proved by the idempotents approach 
cf. [9]. 

4This simple example also shows that the need to find coprime solutions of congruences, which 
existence is proved in Theorem 1.2, is not unusual, but nobody formulated this explicitly prior to 


[1]. 
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2 Idempotents and Group Elements 


We shall use the terminology introduced in [8]. For reader’s convenience we shall 
repeat some basic facts in this section. 

If n € Z, then the cyclic or monogenic semigroup s(x, 2)={[x]n, [x]?, [x]?, ...} 
generated by an x € Z is finite and therefore there exist the least positive integers 
k = k(x,n), and d = d(x,n) such that x’ = x't4 (mod n) for all t > k(x,n), t € Z. 
The integer k is called the index of x, and d is called the period of x modulo n. 
Consequently the cyclic semigroups generated by x have the form 


dis lence lle tlewete ll twill ceaus (2.1) 


The previous result was epitomized by Schwarz [11] as the individual Fermat— 
Euler theorem (cf. also [8, Theorem 1.9]): 


Lemma 2.1 [fx € Z, then 
Ota) = KO (mod n) 


and the numbers k(x) and d(x) are the least positive numbers with this property. 


An older theorem of Frobenius (cf. [8, Theorem 1.1]) says that the periodically 
repeating set {[x]*,..., [xJ{t4-1} in (2.1) forms a cyclic group with respect to the 
multiplication in Z,,. Its identity element, say [e],,, is an idempotent in Z,, i.e.,e = e 
(mod n). Moreover, the element [e] is the unique idempotent of Z, which belongs 
to s(x, n) what is usually phrased that the element x belongs to the idempotent ec. 
The idempotent to which a € Z belongs modulo n will be denoted by i(a,n), or 
simply i(a) if the modulus n is clear from the context. 

The set 


P,,(e) = {x € R; x belongs to e} 


is the largest subsemigroup of (Z,,-), which except for e contains no other 
idempotent of Z,, (see [8, Theorem 1.2]). By this property uniquely determined 
maximal subsemigroup P,,(e) of (Z,,-) containing e will be called the maximal 
(multiplicative) semigroup belonging to the idempotent e. 

It follows easily that Z, = (.<¢,Pn(e) where E = E, denotes the set of all 
idempotents in Z,, and that the union is disjoint. 

If [e] € £, is an idempotent, then there always exists a subgroup of (Zp, .) 
containing [e] as its identity, e.g., the group {[e]}, or the subgroups of any cyclic 
semigroups s(x, 2) with [e], as the identity in the above-mentioned Frobenius result. 
It can be shown that there always exists a (unique) maximal subgroups of (Z,,-) 
amongst all the subgroups of Z, which identity element is the given idempotent 
[e] € E,. This group G,(e) is called the maximal (multiplicative) subgroup of 
(Z,, .) belonging to the idempotent e € E,, (see [4] or [8]). Since G,,(e) C P(e) 
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Fig. 1 Hasse diagram of the divisibility structure of all divisors of 420 = 2? - 3-5-7 (unitary 
divisors are represented with black background) 


two such maximal groups are either disjoint or identical. Generally, if [a] € Z, 
belongs to a multiplicative subgroup of Z, we say that [a] or a is a group member 
modulo n. 

Note that always’ G,,(1) = P,,(1), but P,,(0) as the set of nilpotent elements of 
Zn is usually a proper overset of G,(0) = {0}. 

The set E,, can be endowed with an (partial) order® 


XxSy SQ XV=SX. 


We immediately have 
Lemma 2.2 We have i(a,n) > i(ab,n) for every a, b,n € Z. 


Let t be a divisor of n. We say that ¢ is a unitary divisor’ of n if gcd(t, 4) = 1. 
The unitary divisor d of an n will be said to be generated by tf if both f and d have 
the same set of prime divisors. The unitary divisor d of n generated by a ¢|n will be 
denoted by u(t, ), or simply u(t). 


Sef. Lemma 2.6. 


®For instance, an idempotent e € E,, is called primitive if it is minimal in the ordered set (E, \ 
{0}, <), and maximal if it is maximal in (E, \ {1}, <), but we shall not use these notions, even if 
they play an important role in [8]. 


u2 


Note that if 2 = + 1p{'p;’ -- +p” then the number of unitary divisors of n is 2” and every of them 
is of the form Tey P; for some J C {1,2,..., r}. For example, see Fig. 1. 
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Note that if e,f € E, are two idempotents of Z,,, then the relation e < f can be 
equivalently expressed in the form u(f,1)|u(e, n). 


Lemma 2.3 ([8, Theorem 2.1]) There exists a one-to-one correspondence between 
unitary divisors of n and idempotents of the residue class ring Z,. More precisely, 
every idempotent of Z, is expressible in the form [ta], where t is a unitary divisor of 
nand [a] € G,(1) is given by a solution to the congruence 


ta = 1 (mod “). (2.2) 


Iff € E, and [f]n = [ta]n, [a] € G,({1]), then we say that [f], is the idempotent 
belonging to the (unitary) divisor ¢ (of n, if the modulus is not clear from the 
context). Denote this idempotent by i(t,”), or more precisely i(u(d,n),n) where 
d\n. Note that Theorem 2.3 and the following Lemma 2.4 guarantee that there is 
no ambiguity with the previously introduced symbol i(a, n) assigned to an element 
a € R. Ina similar way we can write u(a,n) for the unitary divisor of n generated 
by the gced(a, n). 


Lemma 2.4 Letx,n € Zandt = gcd(n, x). If u(t, n) is the unitary divisor to which 
the idempotent i(x, n) belongs, then we have u(gced(x, n),n) = u(ged(i(x, n),n), n). 


Proof Let [x] = [ta] and [i(x,n)] = [db] with a,b € G,(1), be representations of x 
and i(x, n) given by Corollary 1.4. Since [x]* = [i(x, n)] for some k, the divisors t,d 
of n must have the same set of prime divisors, and the Lemma follows. 

There follows from [8, Theorem 2.4] that: 


Lemma 2.5 Let [f] € Z, be the idempotent belonging to the unitary divisor d of n. 
Then 


Gi(f) = [d]GnQ) (2.3) 
and 
Pi(f)=Gilf)U (J WG.(). (2.4) 
t\d,t<d 
u(t,n)=d 


Another important interrelation between P,,(e) and G,(e), e € E, is shown in the 
following result: 


Lemma 2.6 ([8, Corollary 1 of Theorem 1.3]) [fe € E,, then 
G,(e) = Prlee, (2.5) 


specifically for e = 1 we have G,,(1) = P,(1). 
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Moreover 


Lemma 2.7 Ife,f € E,, then 


Gnle)f = Grlef) (2.6) 
and consequently for e = 1 
Gr) = Grif). (2.7) 
Suppose that 
N= Pi'Py'**- Di, (2.8) 


is the standard form of a modulus n, n € Z. 

The next results show how it can be determined which power of an element 
[x] € Z, belongs to a (maximal) subgroup of Z,,. 

Let [x] € Z, be an element belonging to the divisor d = TTj=1 p;’ , where 0 < 
v; < uj. Then define (see [8, Theorem 2.5] or [4]) 


1, ifd = 1, 


uj . 
maxXjef1,...3| 7 |, otherwise. 
i 


v(b]) = (2.9) 


yjyF0 
Lemma 2.8 ([8, Theorem 2.5]) For every x € Z we have k(x,n) = v([x]). 
Consequently, [x]' is a group element of (Zn,-) if and only if t => v([x]). 


Corollary 2.9 If n is square-free,* then P,(e) = G,(e) for every idempotent e of 
Zn. 


For further relationships between maximal semigroups and maximal groups in 
Zn consult [8]. 


3 Idempotents and the Solvability of ax = b (mod n) 


In this part we prove some results about the solvability of congruence (1.1) in terms 
of idempotents to which parameters a, b, and n of this congruence belong. 


Theorem 3.1 /f the congruence (1.1) has a solution xo, then i(a,n) - i(xo,n) = 
i(b,n). Consequently i(a,n) = i(b,n), and i(xo,n) > i(b,n). 


8That is, uw) = +--+ =u, = 1 in (2.8). 
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Fig. 2. Hasse diagram of the divisibility structure of idempotents in Z49 (the idempotent is in the 
first row, its unitary divisor in the second one) 


Proof If there is an xo such that axy = b (mod n), let i(a) = a*, i(xo) = re 
i(b) = b". If K = lem{k, €,n}, then aXxh = b*, which implies i(a)i(xo) = i(d). 
Consequently 


i(a) -i(b) = i(a) - (i(a) - i(x0)) = ((@) - (4) - 1X0) = (a) - 1X0) = Hb) 


and 


i(D) - (xo) = (i(a) - i(%0)) + Ho) = (a) - (x0) = iC), 


and the Lemma follows. 


Unfortunately, the condition i(a) > i(b) of Theorem 3.1 is not sufficient for a 
solvability of congruence (1.1) in general: 


Example 3.2 Consider the congruence 4x = 6 (mod 420). Since ged(4, 420) = 4, 
the congruence is not solvable. On the other hand, in the ring Z429 we have i(6) = 
36, i(4) = 316, and [36] < [316] (see Fig. 2). 


In the previous Example 3.2 we have [6] € Pa20(36) \ Gazo(36) (and [4] € 
Ga (316)), i.e., [6] is not a group element. In the opposite case we have 
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Theorem 3.3 /f i(a,n) > i(b,n) and [b], is a group element,’ then the congru- 
ence (1.1) is solvable in Z. 


Proof If i(a) > i(b), then i(a) - i(b) = i(b). First we prove that!” 
P,(i(a)) - (b) C Gy(i()). 


If [x] € P,(i(a)), then [x]' = i(a) for some positive integer t. Consequently, ([x] - 
i(b))' = i(a)-i(b) = i(b) € G,,(i(b)). Moreover (Lemma 2.7) G,(i(b)) = Gn(1)i(d). 
Thus there exists a y € G,(1) such that [a] -i(b) = [y]-i(b). Then i(b) = [a]-[y]~!- 
i(b). Since b € G,,(i(b)) due to our assumption and G,,(i(b)) = P,,(i(b)) - i(b) due 
to Lemma 2.6, there exits a t € P,(i(b)) with i(b) -t = [b]. Then x = a-y!- i(b)-t 
solves (1.1). 


Example 3.4 A sufficient condition for [b] € G,(i(b)) is the equality G,(i(b)) = 
P,,(i(b)).!! If n = 420 we have G429(e) = Pazo(e) for the following idempotents: 


e € {1,21, 85, 105, 141, 225, 301, 385}. 


Here [120]420 < [400]a20 (cf. Fig. 2), P420(400) = {10, 20, 40,50, 80, 100, 110, 
130, 160, 170, 190, 200, 220, 230, 250, 260, 290, 310, 320, 340, 370, 380, 400, 410}, 
P49 (120) a {30, 60, 90, 120, 150, 180, 240, 270, 300, 330, 360, 390} and 
Gaz9(120) = {60, 120, 180, 240, 300, 360}. Clearly, all 24-6 = 144 possible 
combinations [a] € P429(400) and [b] € G429(120) lead to a solvable congruence 
ax = b (mod 420). 


On the other hand, neither the condition G,(i(b)) = Pn(i(b)) nor [b] € Gy(i(b)) 
is necessary for the solvability of congruence (1.1). 


Example 3.5 In congruence 6x = 42 (mod 420) we have i(6) = [36] > i(42) = 
[336] and P429(336) = {42,84, 126, 168, 252, 294, 336,378} 2 Ga20(336) = 
{84, 168, 252, 336}. Nevertheless, the congruence is solvable. !” 


Another additional condition completing the first condition of Theorem 3.1 and 
securing the solvability of (1.1) says 


Theorem 3.6 Let a,b,n € Z. If gcd(b, n) is a unitary divisor of n, then the condi- 
tion i(a,n) = i(b,n) is also sufficient for the solvability of the congruence (1.1). 


Proof In Z, condition i(a) > i(b) means that 


Iem(u(b, n), u(a,n)) = u(b,n). 


°That is, [b] € G,,(i(b)). 

'0Note that Lemma 2.6 shows that P,,(i(b)) - i(b) = G,, (i(b)). 

' This condition is satisfied for every idempotent if n is square-free as it follows from Corollary 2.9. 
‘Having, by the way, the same set of solutions as the congruence in Example 3.8. 
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In other words, every prime divisor of a dividing n also divides b. Since u(b,n) 
is a unitary divisor of n, every prime power dividing gcd(a,n) divides also b, 
that is gcd(a,n)|b, what is the necessary and sufficient condition for solvability of 
congruence (1.1). 


Lemma 2.5 implies that every element of G,,(i(b)) is divisible by the unitary 
divisor u(b,n) and (2.4) shows that these are the only elements in P,,(i(b)) (and 
consequently in the whole Z,,) possessing this property. Thus Theorem 3.6 is also a 
consequence to Theorem 3.3. 


Theorem 3.7 [f xo is a solution to (1.1), d = ged(a,n) and 6 = ged (4. ), then 


Z 
i(b,n) < i(xo,n) < i(6,n). (3.1) 


Proof The left-hand side inequality follows from Theorem 3.1. To prove the right- 
hand side, let, as in proof of Theorem 1.2, write a = ajd, b = bid, andn = njd. 
Then (1.1) reduces to (1.4). If a prime p divides 6 = gced(b, 11), then p|m and p + 
a. Consequently, p|xo and thus every prime dividing u(6, 7) also divides u(x, 1), 
that is u(6,1)|u(xo, 2), or i(xo,n) < i(6,n). 


Example 3.8 Consider the congruence 
18x = 126 (mod 420). (3.2) 


Here ged(18, 420) = 6 = 2-3, gcd(126, 420) = 42 = 2-3-7, i(18, 420) = [36], 
i(126, 420) = [336], [18] € P420(36) \ Gaz0(36), and [126] € P429 (336) \ Gaz9(336). 
Further § = ged (48, 4°) = ged(21, 70) = 7, i(7, 420) = [301]. 

The set of all incongruent solutions mod 420 is 


7,77, 147, 217, 287, 357. 
They belong to the following idempotents mod 420 
21,301. 


The distribution of solutions in groups and semigroups belonging to corresponding 
idempotents is tabbed in Table 1. 


Example 3.8 also shows that a fulfillment of the chain of inequalities (3.1) is 
not sufficient for an xo to be a solution. In this example 6 = 7, i(7,420) = [301] 


Table 1 Distribution of e Ga20(e) | Papo (e) 
solutions to 6x = 42 1 7 i, 
(mod 420) (21) | 147, 357 —— |S 
[301] | 7,77, 217, 287 |G 
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and i(42, 336) = [336]. However, there is no solution belonging to the idempotent 
[196] = i(28, 420) lying between [301] and [336]. 
The following result gives our first possible generalization of Theorem 1.2: 


Theorem 3.9 Let d = gcd(a,n) and alb. Let 5 = ged (3, “). Then a congru- 
ence (1.1) has a solution xo belonging to P,,(i(6,n)). 


Proof Suppose that besides (1.3) we also have 
bi = bbo, ny = dno. (3.3) 
Then given a solution xo to (1.1) we can also write xp = 5x2 and (1.1) reduces to 
\X2 = by (mod nz). (3.4) 


Since gced(b2,n2) = 1, all solutions to congruence (3.4) are of the form x, + kn 
with a suitable x), satisfying gcd(x,,n2) = 1. Then ay - (6x4) = dbo (mod dnp), 
and also ajd - (6x) = db, (mod dn;). In other words, 5x4, solves the original 
congruence (1.1). 

Theorem 1.3 implies that there is an x” = x, (mod ng) such that ged", n)) = 1. 
Thus 5x” solves (1.1) and simultaneously 6x” € P,,(i(6,n)) according to Corol- 
lary 1.4 and Lemma 2.5. 


Note that Example 3.8 shows that a congruence (1.1) satisfying conditions of 
Theorem 3.9 can have further solutions not belonging to P,,(i(6,7)). 

In the case of coprime solutions handled in Theorem 1.2 we have 6 = 1. In this 
case we have G,(1) = P,(1) for every n, that is every solution to (1.1) which is 
coprime to its modulus n is immediately a group element. However in the case of 
a general idempotent e € E, we have, as already noticed several times, G,(e) & 
P,,(e). Therefore not every solution to (1.1) belonging to P,,(i(6, m)) must be a group 
element. Nevertheless, our previous results imply the following second and possibly 
more close generalization of Theorem 1.2: 


Corollary 3.10 Let d = gcd(a,n) and d|b. Let 5 = gcd (3, “). Then a congru- 
ence (1.1) has a solution xo which is a group element! if and only if 8 is a unitary 
divisor of n. 


Proof If 6 is a unitary divisor of n, then every solution to (1.1) is divisible by 6 and 
Theorem 2.5 finishes the proof. 


Note that if n is square-free then Corollary 2.9 implies that G,(e) = P,(e) for 
every idempotent e € Z,, and consequently: 


Corollary 3.11 Let n be square-free and gcd(a,n)|b. Then every solution to a 
congruence (1.1) is a group element. 


Example 3.12 Letn = 210 = 2-3-5-7. The Hasse diagrams of its divisors and 
idempotents in Z>19 are given in Fig. 3. 


'3More precisely belonging to the group G,,(i(5, n)). 
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Fig. 3. Hasse diagrams of the divisors (top) and the idempotents in case of modulus n = 210 
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Consider the congruence 
18x = 84 (mod 210). (3.5) 


Here gced(18,210) = 6 = 2- 3, gcd(84,210) = 42 = 2-3-7, i(18,210) = 
[36], i(184, 210) = [126], [18] € P210(36) = Goi0(36), and [126] € P219(336) 
Go19(336). Further 6 = ged (#, 7°) = ged(14, 35) = 7, i(7, 210) = [91]. 

The set of all incongruent solutions mod 210 is 


II 


28, 63, 98, 133, 168, 203. 
They belong to the following idempotents mod 420 
21,91, 126, 196. 
The distribution of solutions in groups and semigroups belonging to corresponding 


idempotents is tabbed in Table 2. 


It is easy to find a solvable congruence (1.1) with no solution coprime to n. 
Example 3.8 shows that it is possible that a congruence (1.1) has no solution coprime 
to its modules, but has solutions being group elements. The following example 
shows a case of a congruence (1.1) having only solutions being group elements 
including coprime ones and of type not covered by Corollary 3.11: 


Example 3.13 Consider the congruence 
231x = 63 (mod 420). (3.6) 
Here 231 = 3-7-11, 63 = 3-7, gcd(231, 420) = gcd(63, 420) = 21 = 3-7, and 


i(231, 420) = i(63, 420) = [21] and {[63], [231]} € Gago(21) = P4y9(21). Further 


8 = ged(¥, 2°) = ged(3, 20) = 1, ie. (1, 420) = [I]. 


The set of all incongruent solutions mod 420 is 
13, 33, 53, 73, 93, 113, 133, 153, 173, 193, 213, 233, 253, 273, 293, 313, 333, 353, 373, 393, 413. 
They belong to the following idempotents mod 420 


1,21, 141, 301. 


Table 2 Distribution of 7 


e Gyao(e = Pyo(e 
solutions to 18x = 84 1 G s20(¢) s20(e) 
(mod 210) [21]_|63 

[91] __| 133, 203 

[126] | 168 

[ 


196] | 28, 98 
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Table 3 Distribution of solutions to 231x = 63 (mod 420) 


G42 (e) 


The distribution of solutions in groups and semigroups belonging to corresponding 
idempotents is tabbed in Table 3. 


The following example shows that there are congruences with no solution being 
a group element at all: 


Example 3.14 Consider the congruence 
231x = 42 (mod 420). (3.7) 


Here 231 = 3-7- 11,42 = 2-3-7, gced(231, 420) = 21, ged(42, 420) = 42, and 
i(231, 420) = [21], (42,420) = [336] while [231] € Gaz9(21) = P420(21), [42] € 
P439(336) \ G4zo(336). Further 6 = ged (#2, 3°) = gced(2,20) = 2, i(2,420) = 
[316]. 

The set of all incongruent solutions mod 420 is 


2,22, 42, 62, 82, 102, 122, 142, 162, 182, 202, 222, 242, 262, 282, 302, 322, 342, 362, 382, 402. 
They belong to the following idempotents mod 420 
36, 196, 316, 336. 


The distribution of solutions in groups and semigroups belonging to corresponding 
idempotents is tabbed in Table 4. 


The final example shows that it possible to have both group element solutions 
and non-group element solutions: 


Example 3.15 Consider the congruence 


6x = 36 (mod 420). (3.8) 
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Table 5 Distribution of e Gazo(e) | Paz0(e) 
solutions to 6x = 36 36 216 6 
(mod 420) 


[316] | 76,356 | 146,286 


Here 6 = 2-3, 36 = 2? - 37, gcd(6, 420) = 6, gcd(36,420) = 12 = 2? - 3, 
and i(6,420) = i(36,420) = [36], and clearly [36] € Gu0(36). Further 6 = 
gcd (#8, 2°) = gcd(6, 70) = 2, i(2, 420) = [316]. 

The set of all incongruent solutions mod 420 is 


6, 76, 146, 216, 286, 356. 
They belong to the following idempotents mod 420 
36, 316. 


The distribution of solutions in groups and semigroups belonging to corresponding 
idempotents is tabbed in Table 5. 


4 Wolfgang Schwarz, Me, and the Role of Idempotents 


The mathematical content of this paper is actually a collection of corollaries to a part 
of work done during my stay at the J.W.Goethe Universitat in Frankfurt am Main 
as a fellow of the Alexander von Humboldt (AvH) Foundation in 1980/81. The part 
of my research devoted to properties of idempotents later served as the basis of 
my joint paper [8] with my Ph.D. student and collaborator M.La88ak. I had spent 
12 months in Frankfurt starting in October 1980 under supervision of Professor 
Wolfgang Schwarz. I have a vivid memory of the excellent mathematical library on 
Robert—Mayer-Strasse to which I was given access immediately after my arrival at 
the railway station in Frankfurt. I received the library key even earlier than the key to 
the flat in Beethovenstrasse where I lived for the duration of my stay. Getting a key 
to the library was a big surprise to me because at that time it was impossible for me 
and my coworkers to enter my home institute library in the absence of the librarian, 
and here it was a common thing even also for me from an “enemy country” (which 
was an official political jargon for countries behind the Iron Curtain). I spent a lot 
of time and many weekends in this library. Motivated by the paper [11], I conducted 
a systematic literature research in this library on the history of the idempotents and 
their role in the ring theory, including the crucial paper [4] on the group elements. 
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This significantly expanded my previous knowledge on the role of idempotents in 
the semigroup theory advocated by Professor Stefan Schwarz" in his paper [11]. 

I met Prof. Wolfgang Schwarz for the first time during a number theory 
conference in Debrecen (Hungary) in 1974. Later he encouraged me to apply for 
the above-mentioned fellowship from the AvH Foundation. My stay in Frankfurt 
gave me an impetus and motivation for various non-mathematical activities. For 
instance, after my return I convinced the other Schwarz, Professor Stefan Schwarz, 
to allow us to visit the library in my institute also at a time when no librarian was 
present. 

One of the activities of Professor Wolfgang Schwarz which I admired greatly 
was his service as a “Vertrauensdozent der Frankfurter Hochschulgruppe” of the 
Cusanuswerk,'° a position he was very proud of. As a part of this service he invited 
two to three dozen students of various specializations to his home in Ruppertshain 
twice a year. There he organized an evening party that included a colloquial lecture 
on a topic of general interest for these students. These evenings were one of the 
most pleasant experiences that left an indelible imprint on my stay in Frankfurt. 
Last but not least, it was Professor W. Schwarz and his wife Doris who invited me 
and my whole family to visit them immediately after the collapse of the communism 
in Czechoslovakia. It was our first joint family visit to a western country. Previously 
I was able to travel with my wife only under the condition that our children stayed 
at home. Later we had visited them on number of occasions. To this day, all of my 
family treasure the hospitality we had enjoyed. Lieber Herr Schwarz, Ehre Ihrem 
Andenken! 
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experiences with the interplay between number theory and graph theory. To start 
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1 Introduction and Some Highlights in History 


Besides geometry, number theory is considered to be one of the founding fields 
of their science by almost all mathematicians.” As against number theory, graph 
theory — at least considered as a mathematical discipline — is undoubtedly young. 
Although many textbooks emphasize Euler’s solution to the famous problem of 
the seven bridges of Kénigsberg* from 1736 to be the first graph-theoretical paper, 
graphs as a mathematical concept did not make an explicit appearance in it.* In fact, 
Euler himself denied to consider his proof as an application of ‘true’ mathematics. 

As usual it took quite some time before the formal mathematical notion of a 
graph was established. It was Sylvester who introduced the term graph in 1878 in 
the context of chemistry.° The birth of graph theory as a scientific discipline in its 
own right was marked by the treatize Die Theorie der reguliren graphs® published 
by Petersen in 1891. In the years that followed the new field grew rapidly. The first 
and for many years only textbook on graph theory’ was published by Denés K6nig 
in 1936. A lot of fundamental results originated from the 1930s. 

Even at this early stage the first interrelations between number theory and graph 
theory occurred. An important example is what is now called Ramsey theory, dealing 
with the following question: 


Assume that a set S has a certain property. Under which assumptions are we able 
to guarantee that, after partitioning S, at least one of the subsets of S still has that 
property? 


If it holds, this concept may be viewed as a generalization of the box principle 
(or pidgeon hole principle): after partitioning a set S with |S| > 2 into less than 
|S| subsets, there is at least one subset with again > 2 elements. In 1930 Ramsey 
published an article formulated in terms of formal logic.® Nowadays the classical 
results of Ramsey are usually formulated in the language of graph theory. 


?For a very readable discussion on the formation of number theory as a discipline we recommend 
the essay A book in search of a discipline by C. Goldstein and N. Schappacher in: The Shaping 
of Arithmetic after C.F) Gauss’s Disquisitiones Arithmeticae, C. Goldstein, N. Schappacher, J. 
Schwermer (eds.), Springer, 2007. 

3L. Euler, Solutio problematis ad geometriam situs pertinentis, Commentarii Academiae Scien- 
tiarum Imperialis Petropolitanae 8 (1736), 128-140 = Opera Omnia (1) 7 (1911-56), 1-10. 

4W. Hopkins, The Truth about Kénigsberg, The College Mathematics Journal 35 (2004), 198-207. 
SUI. Sylvester, Chemistry and Algebra, Nature 17 (1878), 284-284. 

6}. Petersen, Die Theorie der reguldren Graphs, Acta Mathematica 15 (1891), 193-220. 

7D. K6nig, Theorie der endlichen und unendlichen Graphen. Kombinatorische Topologie der 
Streckenkomplexe, Math. u. ihre Anwendung in Monogr. u. Lehrbiichern 16, Akad. Verlagsge- 
sellschaft, Leipzig, 1936. 

SEP. Ramsey, On a problem of formal logic, Proceedings of the London Mathematical Society 
30(1) (1930), 264-286. 
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Theorem 1.1 (Ramsey’s Theorem, Two-Colour-Version) Given two complete 
graphs K, and K; (r,b = 2), one can find a complete graph having the property that 
with respect to any two-colouring of its edges (e.g. with red and blue) it contains a 
red K, or a blue Kp. 


As a special case, we have the simple fact that every graph G = (V, E) with at least 
six vertices contains a complete triangle, i.e. a three-clique, or the complementary 
graph G = (V,E) (where E consists precisely of those edges between vertices in V 
which are missing in £) contains a complete triangle, i.e. G contains an independent 
set of 3 vertices. 

Ramsey’s theorem was not the first one of its kind. As early as 1927 van der 
Waerden proved the following result in number theory’: 


Theorem 1.2 For any positive integer r there exists a positive integer n and an 
arithmetic progression P of length r lying in the set {1,...,n} such that any function 
f:{1,2,...,n} > {1,2,...,r} restricted to P is constant. 


Now let us reformulate van der Waerden’s theorem 


— in terms of colours: For each positive integer r there is ann such that {1,...,n} 
contains a monochromatic arithmetic progression of length r for any r-colouring. 

— in terms of partitions (cf. initial characterization of Ramsey theory): For each 
positive integer r there is ann such that at least one subset of any r-partition of 
{1,...,} contains an arithmetic progression of length r. 


Again and again graph theory and number theory have applied tools from the 
other discipline to solve problems of their own. A famous example is the following 
generalization of van der Waerden’s theorem, conjectured by Erdés und Turan in 
1936": 


Conjecture 1.3 Each set of positive integers having positive natural density con- 
tains an arithmetic progression of length k for any k. 


This conjecture was proven in 1975 and has been known as Szemerédi’s theorem 
ever since. Szemerédi’s celebrated proof of this number-theoretic (looking?) asser- 
tion used combinatorial and, in particular, graph-theoretic tools.'' For k = 3, the 
result was established in 1953 by Klaus Roth! by adapting the Hardy—Littlewood 
circle method. Meanwhile, further completely different proofs are known. One of 


°B.L. van der Waerden, Beweis einer Baudetschen Vermutung, Nieuw Arch. Wisk. 15 (1927), 212— 
216. 

0p Erdés and P. Turan, On some sequences of integers, Journal of the London Mathematical 
Society 11(4) (1936), 261-264. 

MIE. Szemerédi, On sets of integers containing no k elements in arithmetic progression, Acta 
Artithmetica 27 (1975), 199-245. 

2K. Roth, On certain sets of integers, Journal of the London Mathematical Society 28(1) (1953), 
104-109. 
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them, using ergodic theory, was published by Furstenberg,'’ another more recent 
one by Gowers! applies tools from both Fourier analysis and combinatorics. 
Terence Tao! coined the phrase of a Rosetta stone to illustrate how various proofs of 
Szemerédi’s theorem connect different fields of mathematics. Green and Tao were 
the ones who showed that Szemerédi’s theorem is not the end of the story. They 
proved'® that even the set of prime integers, although having density zero, contains 
arithmetic progressions of arbitrary length, the reason being that the sequence of 
primes behaves ‘random enough’ besides the fact that is it ‘not too thin’. 


2 Divisibility and Coprimality 


Erdés himself pointed out repeatedly how helpful graph theory could be for certain 
problems in number theory.!? Quite often graphs can be used to model correlations 
between integers.!* Classical examples are divisor graphs and coprime graphs (also 
called relatively prime graphs). 

The vertices of a divisor graph are given by the elements of any set of positive 
integers. Two such integers m and n, say, are considered adjacent if and only if 
one of them divides the other. Of course, one could refine the model by using a 
directed graph to distinguish between m | n and n | m. Anyway, the resulting graph 
models the divisibility relation in the given set. As an example of the benefit of 
this perspective, Pomerance!® gained surprising structural insights by investigating 
longest paths in divisor graphs. Chartrand et al. asked: Which graphs do occur as 
divisor graphs? More precisely, they investigated for which graphs the vertices can 
be numbered by different positive integers such that the divisor graph of this set of 
integers is the given one. Chartrand et al.” proved among other things: 


13H. Furstenberg, Ergodic behavior of diagonal measures and a theorem of Szemerédi on 
arithmetic progressions, Journal d’ Analyse Math. 31 (1977), 204-256. 

47. Gowers A new proof of Szemerédi’s theorem, Geom. Funct. Anal. 11(3) (2001), 465-588. 

ST. Tao, The dichotomy between structure and randomness, arithmetic progressions, and the 
primes, pp. 581-608 in: M. Sanz-Solé, J. Soria, J.L. Varona et al., International Congress of 
Mathematicians Ziirich, European Mathematical Society, Ziirich, 2007. 

16B. Green and T. Tao, The primes contain arbitrarily long arithmetic progressions, Annals of 
Mathematics 167(2) (2008), 481-547. 

"E.g. P. Erdés, Some applications of graph theory to number theory, pp. 77-82 in: The Many 
Facets of Graph Theory, Proc. Conf. Western Mich. Univ., Kalamazoo, Mich. 1968, Springer 
LNM, Berlin, 1969. 

'8Cf. chapter Graphs on the integers, in: R.L. Graham, M. Grétschel, L. Lovdsz, Handbook of 
Combinatorics, Vol. 1, North-Holland, New York, 1995. 

°C, Pomerance, On the longest simple path in the divisor graph, Congressus Numerantium 40 
(1983), 291-304. 

20G. Chartrand, R. Muntean, V. Saenpholphat, and P. Zhang, Which graphs are divisor graphs?, 
Congressus Numerantium 151 (2001), 189-200. 


Recent Developments on the Edge Between Number Theory and Graph Theory 409 


. Every complete graph is a divisor graph. 

. Every tree and, more generally, every bipartite graph is a divisor graph. 

. Every induced subgraph of a divisor graph is again a divisor graph. 

. Divisor graphs cannot contain induced circles of odd length greater than 3 (i.e. at 
most triangles). 

5. A graph without triangles is a divisor graph if and only if it is bipartite. 


BwWN Fe 


Finally, they gave this elegant characterization: 


Theorem 2.1 A graph is a divisor graph if and only if there exists an orientation 
of its edges such that every non-isolated vertex v is either source (i.e. v has only 
outgoing edges) or sink (i.e. v has only incoming edges), or, if v has incoming 
and outgoing edges, then each incoming neighbour”! of v has edges to each of 
the outgoing neighbours of v. 


One of the most spectacular results in graph theory was the proof of an old 
conjecture of Berge, namely the strong perfect graph theorem.”* By using this 
powerful tool, Al-Addassi et al. showed in 2012 that all divisor graphs are perfect,”* 
i.e. the chromatic number of every induced subgraph equals the size of the largest 
clique of that subgraph. The strong perfect graph theorem implies a weaker 
statement** saying that a graph is perfect if and only if its complement is perfect 
as well. 

As a second example, we shall look at coprime graphs. Given a set S of positive 
integers, the corresponding coprime graph Copr(S) is defined to have vertex set S$ 
and edges between m,n € S if gcd(m,n) = 1. It is easy to show that every graph 
is a coprime graph Copr(S) for a suitable set 57°. At this point it is natural to shift 
the focus from the question which graphs are coprime graphs to the correspondence 
between arithmetical features of S and certain graph properties. For example, Erdés 
und Sarkézy looked for circles in coprime graphs” and the existence of complete 
tripartite subgraphs.”° 

In the literature, one can also find a more restrictive notion of coprime graphs: a 
graph having n vertices is called a coprime graph if its vertices may be numbered by 
the integers 1,..., in such a way that all pairs of adjacent vertices are coprime (but 


2! Neighbour is synonymously used for adjacent vertex. 

22M. Chudnovsky, N. Robertson, P. Seymour, and R. Thomas, The strong perfect graph theorem, 
Annals of Mathematics 164(1) (2006), 51-229. 

238, Al-Addasi, O.A. AbuGhneim, and H. Al-Ezeh, Further new properties of divisor graphs, J. 
Comb. Math. Comb. Comput. 81 (2012), 261-272. 

4 As also conjectured by Berge and shown by L. Lovdsz, (1972), A characterization of perfect 
graphs, Journal of Combinatorial Theory, Series B 13(2) (1972), 95-98. 

°5P. Erdés and G.N. Sarkézy, On cycles in the coprime graph of integers, Electron. J. Comb. 4(2) 
(1997), Research paper R8. 

26G.N. Sarkézy, Complete tripartite subgraphs in the coprime graph of integers, Discrete 
Mathematics 202 (1999), 227-238. 
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possibly others as well).?” With regard to this stronger definition of coprimality, it 
is no longer obvious which graphs have that property at all. A famous conjecture 
made by Entringer und Tout in the 1980s says that every tree, i.e. every graph 
containing no cycles, is a ‘strong’ coprime graph. For small trees this conjecture 
has been verified by inspection. More importantly it was shown recently that the 
conjecture holds for all sufficiently large trees.”* 

Even more specific is the so-called maximal coprime graph Copr(n) := 
Copr({1,2,...,7}), defined for every positive integer n (where a,b € {1,...,n} 
are adjacent if and only if gcd(a, b) = 1. Quite a few structural features of these 


graphs are known”’: 


— Copr(n — 1) is an induced subgraph of Copr(n). 

— Each spanning subgraph of Copr(n) is a maximal coprime graph itself. 

— Each graph containing a graph which is not maximal coprime has this property 
as well. 


3 Integrality: A First Taste 


The integers are the fundamental object of study for number theorists. There is 
no obvious analogous concept of integrality for graphs. But whenever some graph 
property is measured with a number set containing integers it is natural to ask for 
which graphs the property assumes integral values. A prominent case is the spectrum 
of the adjacency matrix. For a finite graph G = (V, E) with |V| = {v,..., vn}, say, 
the adjacency matrix A(G) = (a,) is defined as the n x n matrix with entries aj = 1 
if v; and v; are adjacent, and aj = 0 otherwise. Observe that for an undirected 
graph G the matrix A(G) is a symmetric 0-1-matrix. Linear algebra tells us that all 
eigenvalues of A(G) are real numbers. Moreover, since every rational eigenvalue 
of an integer matrix is an integer, any graph eigenvalue is either an integer or an 
irrational number. The research revolving around eigenvalues and eigenspaces of 
graphs forms the field of spectral graph theory. 

A graph G is called an integral graph if all eigenvalues of G, which means all 
eigenvalues of A(G), are integers. In other words, integral graphs have an integral 
spectrum. The problem to characterize all integral graphs was first raised in 1974 by 


27This procedure is named coprime labelling or, rather unsuitably, prime labelling. 
28P Haxell, O. Pikhurko, and A. Taraz, Primality of trees, J. Comb. 2(4) (2011), 481-500. 


°See the survey article by S.N. Rao, A Creative Review on Coprime (Prime) Graphs, Lecture 
Notes, DST Work Shop (23-28 May 2011), WGTA, BHU, Varanasi, 2011. 
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Harary and Schwenk.*° Since then many partial answers to this apparently difficult 
question have been published.*! 

Before considering integrality itself, let us consider an example that demonstrates 
how even the question whether a graph exhibits a certain integer in its spectrum leads 
to interesting insights. To this end, let us take a closer look at the spectrum of the 
maximal coprime graph Copr(n). While numerical experiments quickly reveal that 
Copr(n) might not be integral, they also suggest that the integer 0 always belongs 
to the spectrum. Moreover, one is lead to conjecture that the multiplicity of 0 in the 
spectrum increases monotonously as n grows. Since gcd(1, 1) = 1, each coprime 
graph has a loop at 1. This we agree to remove in order to have a loopless coprime 
graph (this does not influence essentially what follows). The authors*” proved the 
following: 


Theorem 3.1 For any integer n > 1, the geometric multiplicity y(n) of the 
eigenvalue 0 of the loopless coprime graph Copr(n) satisfies 


v(n if M(n 1, 
yoy =i) EMO) A 
vin) +1 ifM(n) = 1, 
where |1(n) denotes the Moebius function, v(n) := n— }),<, |“(k)|, and M(n) := 
re LK) is Mertens’ function. Consequently, 


y(n) = (1- <)n + OVA). 


Moreover, it turned out that in case M(n) 4 1 the kernel of the loopless Copr(n) has 
a so-called simple basis containing vectors with no other entries than 0, 1, —1. This 
is wrong if M(n) = 1. In comparison, the coprime graphs with loop at | always 
have a simple kernel basis. 


4 Graph and Groups: Even More Integrality 


Another class of graphs related to algebra and, in particular, to number theory are 
Cayley graphs. Generally, these graphs model algebraic relations between elements 
of groups (or rings). Given a (finite) group H, the Cayley graph Cay(H, S) has vertex 
set H, and we need a so-called set S C H of symbols. Assuming H to be an additive 


39 Harary and A.J. Schwenk, Which graphs have integral spectra?, pp. 45-51 in: Graphs and 
Combinatorics, Springer, Berlin, 1974. 


31K. Balinska, D. Cvetkovic, Z. Radosavljevic, S. Simic, and D. Stevanovic, A survey on integral 
graphs. Publ. Elektroteh. Fak., Univ. Beogr., Ser. Mat. 13 (2002), 42-65. 


32 


J.W. Sander and T. Sander, On the kernel of the coprime graph of integers, Integers 9 (2009), 
569-579. 
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Fig. 1 Cay(Z10, Zf{p) = Cay(Zio, {1, 3, 7, 9}) = UCay(10) = ICG(10, {1}) 


group (H, +), S is required to satisfy 0 ¢ S and S = —S := {-s: s € S} (for 
multiplicative groups (H,-), we require 1 ¢ Sand S = S! := {s!: s € S$} 
correspondingly). Two vertices x,y € H are defined to be adjacent if x —y € S$ 
(or xy_! € S, respectively). By the conditions on S, Cayley graphs are loopless and 
undirected (Fig. 1). 

The research on Cayley graphs has been extensive and diverse.** One of the most 
basic structural problems asks for isomorphic Cayley graphs, where two graphs 
G = (V,E) und G’ = (V’, E’) in general are called isomorphic if there is a bijection 
yg: V — V' satisfying (x,y) € E ©} (g(x), 9()y)) € E’. Trivially, each Cayley 
graph Cay(H, S) is isomorphic to Cay(H, aS) by way of the translation g(x) = ax. 
More generally, every group automorphism o on H yields an isomorphism between 
Cay(H,S) und Cay(H,o(S)). The inverse problem to determine if there is an 
isomorphism between two given graphs is much harder. Even the task to find explicit 
criteria for isomorphy between graphs in very simple subclasses of Cayley graphs 
took decades.** 


4.1 Circulant Graphs 


As an example closer related to number theory, let us consider the class of circulant 
graphs. These are the Cayley graphs over cyclic groups (i.e. over some additive 


33 G, Hahn and G. Sabidussi, Graph symmetry: algebraic methods and applications, Proceedings 
of the NATO Advanced Study Institute and séminaire de mathématiques supérieures, Montréal, 
Canada, July 1-12, 1996; Springer, New York, 1997. 

34C.H. Li, On isomorphisms of finite Cayley graphs—a survey, Discrete Mathematics 256(1—2) 
(2002), 301-334. 
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group (Z,, +) of residue classes mod n; cf. Fig. 1). Their adjacency matrices are 
circulant matrices, which have been studied for quite some time.*> A first important 
result on the isomorphy between circulant graphs was published in 1967, however, 
only in case the graphs have prime order, i.e. the number of vertices is a prime 
number™: 


Theorem 4.1 Let p be prime. Then Cay(Z,,S) and Cay(Z,, S’) are isomorphic if 
and only if there exists a unit a € zy such that S’ = aS. 


This shows that among circulant graphs of prime order only the trivial isomorphisms 
by translation exist. In other words, checking two such graphs for isomorphy can be 
reduced to comparing the two sets of symbols. Unfortunately, this easy reduction 
does not work for circulant graphs over Z, with arbitrary n°’: 


Theorem 4.2 [f Cay(Z,,,S) and Cay(Z,,S’) are isomorphic, then there exists an 
isomorphism by translation if and only ifn € {8, 9, 18} orn = 2°mwithe € {0, 1,2} 
and m odd and squarefree. 


Tremendous efforts were made to find a manageable criterion for isomorphy 
between circulant graphs in general (in analogy to Theorem 4.1), which allows to 
characterize isomorphic circulant graphs by explicit comparison of their symbol 
sets. This was finally proven at the turn of the millennium, and it became apparent 
that the comparison of the two symbol sets has to be performed in ‘layers’. The 
d-th layer (S)q of a symbol S in a group of order n is defined by (S)a = {s € 
S: gced(s,n) = d}, and we have 


Theorem 4.3 Cay(Z,, S) and Cay(Z,, S’) are isomorphic if and only if for each d|n 
there exists an integer mq € Z* such that ma(S)a = (S’)a. 


This result had been conjectured in 1975 by D.K. Zibin, and it was confirmed 
in 2001/2002 independently by two research teams.** The criterion reveals that 
isomorphy is related to certain divisibility properties of the elements of the symbol 
sets. 

In a similar fashion, another arithmetic criterion exists to characterize connec- 
tivity of Cayley graphs. In fact, a general Cayley graph Cay(H, S) is connected if 
and only if the symbol set S generates the group H. Consequently, a circulant graph 
Cay(Z,, S) with S = {s,,...,5,} is connected if and only if gcd(n, 51,..., 5%) = 1. 
More precisely, such a circulant graph satisfying gcd(n, s1,..., 5%) = d decomposes 


35Ph.J. Davis, Circulant matrices, Pure and applied Mathematics. Wiley-Interscience Publication, 
New York, 1979; 2nd ed., AMS Chelsea Publishing, 1994. 

36J. Turner, Point-Symmetric Graphs with a Prime Number of Points, Journal Comb. Theory 3 
(1967), 136-145. 

37This is a result of Muzychuk, to be found in Hahn/Sabidussi.** 

38 M. Muzychuk, M. Klin, and R. Péschel, The isomorphism problem for circulant graphs via 
Schur ring theory, (DIMACS workshop 1999), Ser. Discrete Math. Theor. Comput. Sci. 56 (2001), 
241-264; and E. Dobson and J. Morris, Toida’s conjecture is true, Electron. J. Comb. 9(1) (2002), 
Research paper R35. 
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into exactly d isomorphic graph components Cay(Zy/¢,S). For Cayley graphs in 
general the components are Cay((S) , S). 

Circulants form an important class of graphs having many applications in 
different fields, e.g. in redundant network modelling,*? where the simplest cases are 
powers C of cycle graphs C, with symbol set {—k, —k+1,...,—-1,1,...,k—1, k}. 
Circulant graphs are also related to physics, since they model the behaviour of 
quantum systems. Such a system turns out to be periodic if and only if the 
corresponding graph is integral.*” 

The eigenvalues of circulant graphs, i.e. the eigenvalues of circulant matrices, 
can be computed by a handy formula (cf. Davis*>). Setting w, = exp(2iz/n), the 
eigenvalues A; of Cay(Z,, S) are given by 


— Js —_ = 

Aj = ) of, j=O0,...,.n-1. (4.1) 
ses 

Then v; := (@°,a},...,@"~!)" is a complex-valued eigenvector for A;, and the 

vectors Up,..., Un—1 are a basis of R”. Two questions come to mind: In which cases 


is the circulant graph, i.e. its spectrum, integral? What about real eigenvectors? 

To answer the first question, the layers of the symbol set S' (see Theorem 4.3) 
play the decisive role: Cay(Z,,, S) is integral if and only if S is made up of complete 
layers. If we define G,(d) = {s € Z* : gcd(s,n) = d} we have": 


Theorem 4.4 A circulant graph Cay(Z,, S) is integral if and only if there exists a 
set D of divisors of n such that S = |) G,(d). 
deD 


Consequently, integral circulant graphs are characterized by simply stating n and 
D. Therefore one may define the graph ICG(n, D) as the graph Cay(Z,,, S) with 


S=(JG,@) = {s € ZF : ged(s,n) € D}. (4.2) 
deéD 


A special subclass of circulant graphs are the unitary Cayley graphs UCay(n) := 
Cay(Z,,G,(1)) (Fig. 1). The name is derived from the fact that G,(1) = Z* is 
the group of units in the ring Z,. Using the preceding theorem and some easy 
number-theoretic arguments, the integrality of unitary Cayley graphs can be shown 
as follows. According to the special structure of its symbol set, the eigenvalues of 
UCay(n) are 


A, = > wi = c(r,n) for O<r<n-1. 


l<j<n 
ged(j,n)=1 


39C.K. Wong and D. Coppersmith, A combinatorial problem related to multimodule memory 
organizations, J. Assoc. Comput. Mach. 21 (1974), 392-402. 


40N. Saxena, S. Severini, and I.E. Shparlinski, Parameters of integral circulant graphs and periodic 
quantum dynamics, Int. J. Quantum Inf. 5(3) (2007), 417-430. 


41W. So, Integral circulant graphs, Discrete Mathematics 306 (2005), 153-158. 
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Here c(r,n) is a Ramanujan sum, which is known to be integer-valued for all 
r,n > OQ. Thus unitary Cayley graphs are integral, and their eigenvalues can be 
computed by*” 


A, = c(r,n) = neyo. where f, 


p(t,) 


j4 again denotes the Mdébius function, and g is Euler’s totient function. It is an 
immediate consequence of this formula that each eigenvalue A # 0 of UCay() is a 
divisor of y(n). 

With regard to the characterization of the eigenvalues of unitary Cayley graphs 
in terms of Ramanujan sums it is not difficult to see that, more generally, every 
graph Cay(Z,, G,(d)) is indeed integral. Uniting the symbol sets of such graphs 
yields a partition of the sum in (4.1) accordingly. This implies that the eigenvalues 
of each such graph are sums of the eigenvalues of the graphs Cay(Z,, G,,(d)). By 
this argument we have obtained one direction of the assertion in Theorem 4.4. The 
other direction requires more effort. 


=" = feo ern 1, 
gcd(r, n) 


4.2 Moving on to Integral Cayley Graphs 


In view of Eq. (4.2) one can define a superclass of the integral circulants.** First fix 
a finite abelian group H of order n. Here we may assume H = Z,, x ... X Zm, for 
suitable m,+...-m, = n with gcd(m;, mj) < 2 (for all i A j). We define a tuple-wise 
greatest common divisor of given tuples x = (x1,...,X,) andm = (m,,..., my) by 


gcd(x,m) = (d),...,d,) with d; = gced(x;, mj), 


agreeing on gcd(0,m) := m. Armed with this definition, the definition of the sets 
G,,(d) readily extends to the new context (with respect to the chosen representation 
of H). 

The sets G,,(d) can be viewed as layers of integral symbol sets with respect to 
Zm, X ...X Zm,. More precisely, the sets G,(d) generate a Boolean algebra Boca(H) 
with respect to intersection, union and complement of sets. Using the elements of 
Bsca(H) as symbols, they all give rise to integral Cayley graphs on H. One can 
show that Bgcg is a subalgebra of the Boolean algebra B(/) that is generated by the 
subgroups of H (tacitly removing the neutral element of H from each set). Note that 


See, e.g., PJ. McCarthy, Introduction to arithmetical functions, Universitext, Springer, New 
York, 1986. 

43W. Klotz and T. Sander, Integral Cayley graphs defined by greatest common divisors, Electron. 
J. Comb. 18(1) (2011), Research paper P94. 
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S = S~ for all sets S C B(H) so that they indeed constitute valid symbols. Actually, 
all of the elements of the Boolean algebra B(#) give rise to integral Cayley graphs 
on H.4 

The Sudoku graphs are interesting examples of integral Cayley graphs. The board 
of an n-Sudoku consists of a square arrangement of cells, with n? rows and n? 
columns. The board is subdivided into an n times arrangement of square blocks 
with n* cells each. Successfully solving a (possibly pre-filled) n-Sudoku means 
assigning integers | to n to the cells in such a way that each row, cell and block 
contains distinct numbers. For n = 3 we obtain the classical Sudoku we know from 
our daily newspapers. 

Let us construct a graph associated with an n-Sudoku board. The Sudoku graph 
Sud(n) has n* vertices, representing the distinct cells of the board. Two vertices 
are adjacent if and only if they represent cells that are located in the same row, 
column or block. So the Sudoku graph represents the requirement of filling in 
distinct numbers. Consequently, solving an n-Sudoku puzzle is equivalent to finding 
a proper n-coloring of the graph Sud(n). 

The vertex set of Sud(n) may be identified with Z4. Let us assume that each 
vertex (x), X2,x3,X4) encodes the position of its associated cell as follows: (x1, x2) 
locates the block, by using x, as horizontal and x2 as vertical position index with 
respect to the block grid; (x3,.x4) works in a similar fashion and locates the cell 
within the selected block. Then we have*: 


Theorem 4.5 Sud(n) ~ Cay(Z4, §,;US,US3) for 


n? 


S| = {(0, 0, x3, x4) : X3,X4 € Zn, (x3, X4) # (0, 0)} 
= (Z, x Z, x Z, x Z,) \ (Z, x Z x Z x Z), 
So = {(0,x2,0,x4) : x2,x4 € Zn, x2 F O} 
= (Z, x Z, x Z, x Z,) \ (Z, x Z x Z, x Zn), 
S3 = {(x1,0,%3,0) : x1,x3 € Zy, x1 A O} 


II 


(Z;, x Zy x Zy x Zy) \ (Z, x Zi X Zn X Zi). 


We see that Sudoku graphs are indeed integral Cayley graphs—the sets S; are 
(disjoint) subgroups of H (neglecting the neutral element), so their union belongs to 
B(Z;). 


44W. Klotz and T. Sander, Integral Cayley graphs over Abelian groups, Electron. J. Comb. 17(1) 
(2010), Research paper R81. 

45W. Klotz and T. Sander, Wie kommt Sudoku zu ganzzahligen Eigenwerten? (How does Sudoku 
acquire integral eigenvalues ?), Math. Semesterber. 57(2) (2010), 169-183. 
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In many papers concerning integrality of Cayley graphs, group characters are a 
frequently employed tool. A character is a homomorphism of a group into the 
multiplicative group of C \ {0}. The use of group characters is induced by the fact 
that Cayley graphs are vertex transitive.*° It follows directly from the definition of 
a Cayley graph that the translations of its group do the trick here. Consequently, 
for spectral studies of Cayley graphs, one can employ a method outlined in a 1975 
paper by Lovasz.*’ When applied specifically to Cayley graphs, the method looks 
as follows. Let H be a finite group and y a character of H. Defining 


w(S) = Yo vs), 


ses 


note that 


Y> v@ = v(S)v) 


x€EN(v) 


for every vertex v of Cay(H,S), where N(v) is the neighbourhood* of v. It follows 
that, for any given symbol S, the function y is an eigenfunction of the graph 
Cay(H, S$) for eigenvalue w(S). More generally, one can look for sets T such that 
w(T) € Z for all n characters w of the given group H. 


4.4 Diving Deeper into Integrality 


Let us investigate the mutual relation of the Boolean algebras Byca(H) and B(H), for 
finite abelian groups H. One can show that equality By.q(H) = B(H) holds if and 
only if H can be represented in the form H = Zy x ... x Zy x Zs2*°. A Cayley 
graph on such a group A is integral if and only if its symbol set belongs to the 
Boolean algebra B(#). So here integrality is once again characterized by a certain 
composition of the symbol set. 

This takes us to the question of how to identify all symbol sets giving rise to 
integral Cayley graphs on some fixed group H. To this end, let us take a closer 
look at the involved Boolean algebras. How can they be generated? If His a finite 
abelian group of order n, again represented as a suitable product of cyclic groups, 


46Vertex transitive graphs are exactly those graphs whose automorphism group acts transitively on 
its vertex set. In other words, for any two vertices there needs to exist an automorphism mapping 
the first vertex to the second. 


471. Lovasz, Spectra of graphs with transitive groups, Period. Math. Hung. 6 (1975), 191-195. 
481. the set of all vertices adjacent to v. 
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then each element of By-q(H) is a disjoint union of the layers G,(d). These layers 
are the atoms’? of Beca(H). 

Actually, also the enclosing algebra B(H) is atomic, for any finite (not necessarily 
abelian) group H. Its atoms are exactly the generators of the cyclic subgroups of 
H. Each such atom gives rise’ to an integral Cayley graph on H. As an example, 
consider the cyclic group Z24. We have Gz4(4) = {4, 20}. This is an atom of B(Zz.) 
since its elements generate the cyclic subgroup (4) of Zo4. 

The collection of all symbol sets S for which Cay(H,S) is integral forms yet 
another Boolean algebra. Let us denote it by Bj, (H). Obviously, the atoms of B(A) 
are also elements of By,(H). Hence B(H) is a subalgebra of Bi, (H). Effectively, 
B(A) is generated by a subset of the atoms of By,(H). In the case of finite abelian 
groups H it turns out, however, that both algebras coincide”: 


Theorem 4.6 Jf H is a finite abelian group, then 
By (H) = B(A). 


So a Cayley graph on a finite abelian group H is integral if and only if its symbol 
set is a union of complete generating sets of cyclic subgroups of H. This fact is a 
beautiful extension of Theorem 4.4. 


The structure of the Boolean algebra By,,(H) has been studied quite well>!: 


Theorem 4.7 Let H be a finite group with center C. Then the power set P(H — C) 
constitutes a Boolean subalgebra of Byy,(H). Moreover, By,(H) is a direct product 
of B(C) and P(H — C). 


An immediate and important consequence of this theorem is that Bi,(H) = B(A) 
can only hold if the finite group H is abelian. So we know the limits of atomic 
construction of symbol sets. 

An interesting side question is whether there exist groups such that any symbol 
readily gives rise to an integral Cayley graph. Such groups are called Cayley 
integral. It turns out that Cayley integral groups are scarce. The nontrivial abelian 
Cayley integral groups are represented by** 


2” x 22, Z" x Zt, m,n> 0. 


49 An element a of a lattice with associated partial order < and least element 0 is an atom if 0 < a 
and no element x satisfies 0 < x < a. A lattice is atomic if, for every element b 4 0, we have 
a < b for some atom a. 

50 R.C. Alperin and B.L. Peterson, Integral sets and Cayley graphs of finite groups, Electron. J. 
Comb. 19(1) (2012), Research paper P44. 

5! R.C. Alperin and B.L. Peterson, Integral sets and the center of a finite group, Can. Math. Bull. 
57(1) (2014), 9-11. 
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In the non-abelian case the list is 
S3, Z3 ™ Z4, Og X Z5, n=O, 


where S3 is the symmetric group of order 3, the semidirect product Z3 » Z, is the 
dicyclic group of order 12 and Qg is the quaternion group of order 8. Coincidentally, 
the latter list has been discovered twice, independently and almost simultaneously.>? 


4.5 Playing with Atoms 


Returning to the Boolean algebra B(H), let us take a closer at its atoms and 
generating sets of cyclic groups in general. Here, interesting number-theoretic 
questions arise that can be investigated even beyond the graph-theoretic context. 
Fix a finite abelian group H and define 


atom(a) := {a’: (a'\ = (a)}.° (4.3) 


Let us assume in the following that H is an additive group (H, +). What happens if 
we add two atoms? More precisely, we ask about the structure of the set atom(a) + 
atom(b), where the set sum is defined in the usual way 


M,+M, = {m, +m: m €M,, m2 € Mp} 


for given sets M; and M). Surprisingly, it turns out that atom(a) + atom(b) € B(A) 
for any given pair a,b € H.* With respect to Cayley graphs we see that sums 
of atoms give rise to integral Cayley graphs. This comes in handy when studying 
distance powers of integral Cayley graphs. Distance powers of Cayley graphs are 
again Cayley graphs. It is not difficult to see that the symbol set of such a distance 
power is formed by successive addition of symbols of the original Cayley graph. 
Consequently, such a symbol set is a sum of (disjoint unions of) atoms. Hence it 
gives rise to an integral Cayley graph. 

In essence, any distance power is an overlay of d graphs where each contains 
exactly the edges representing distance d. Each individual layer is an integral 
Cayley graph in itself. Taking this construction to the extreme, one can even do 


52 A. Abdollahi and M. Jazaeri, Groups all of whose undirected Cayley graphs are integral, 
European Journal of Combinatorics 38 (2014), 102-109; and A. Ahmady, J.P. Bell, and B. Mohar, 
Integral Cayley graphs and groups, SIAM J. Discrete Math. 28(2) (2014), 685-701. 

This definition coincides with the concept of atoms in Boolean algebras discussed before. 

*4W. Klotz and T. Sander, Distance powers and distance matrices of integral Cayley graphs over 
abelian groups, Electron. J. Comb. 19(4) (2012), Research paper P25. 

55 The d-th distance power G of a graph G has the same vertex sets as G. Two vertices x # y are 
joined by an edge if and only if their distance in G is at most d. 
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the following: Fix a function m : N — No of multiplicities and construct a weighted 
distance power multigraph by having exactly m(d) edges between any two vertices 
of distance d. The result is an integral graph. 

For cyclic groups the sum of atoms has been studied even further, leading to 
interesting topics of number theory. Given a finite abelian group H and an element 
a € H, we have atom(a) = {a’ € H: (a’) = (a)} as in (4.3). Viewing Z,, in the 
context of the residue class ring Z/nZ we see that the cyclic subgroup (a) of the 
additive group Z, simply is the principal ideal (a) := aZ, generated by a. So the 
previous equation translates to 


atom(a) = {au: 1 <u<n, (u,n) = 1} 


= {ax: 1 <x < ord(a), (x, ord(a)) = 1}, 


where ord(a) = Ga denotes the order of a in the additive group Z,, 

Beyond merely claiming that atom(a) + atom(b) is a disjoint union of atoms, one 
can precisely state for each element y € atom(a) + atom(b) its exact number”? of 
possible sum representations a + 6, with a € atom(a), 8 € atom(b). This has been 


generalized to sums of any number of atoms,’ and to abelian groups.°® 


5 Primality 


There is indeed a concept of primality in graphs. For this purpose it is convenient 
to assume the graphs to be friendly in the sense that they are undirected, connected 
and without leaves, i.e. without vertices having just one neighbour. Friendly graphs 
G contain cycles,°’ and each closed walk which is not the repetition of a shorter 
closed walk is called a prime in G.® Let us illustrate the concept by an example, 
where the graph G simply consists of two cycle graphs C; and C), say, which have 
exactly one vertex v in common. Then both C; and C) are primes in G, but walks 
two or more times along C; or along C, are not. However, a walk starting at v, 


56 JW. Sander and T. Sander, Adding generators in cyclic groups, J. Number Theory 133(2) (2013), 
705-718. 


57 Q.-H. Yang and M. Tang, On the addition of squares of units and nonunits modulo n, J. Number 
Theory 155 (2015), 1-12. 

58M. Kreh, Adding generators in abelian groups, to appear in J. Algebra and Number Theory Acad. 
*°This follows from the fact that trees, by definition, are the cyclefree graphs and must have leaves. 
Here we disregard the technical obstacles of orientation and backtracks, but one can find all 
details of the precise definition of primes in graphs—and most of the other facts of this section—in 
A. Terras, Zeta Functions of Graphs—A Stroll through the Garden, Cambridge studies in advanced 
mathematics, Cambridge University Press, Cambridge, 2011, or the first author’s Arithmetical 
Topics in Algebraic Graph Theory, in: K. Bringmann, Y. Bugeaud, T. Hilberdink, J.W. Sander, 
Four Faces of Number Theory, European Mathematical Society, Lectures in Mathematics, Ziirich, 
2015. 
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following C, twice and then running along C2 once is a prime, while a walk along 
Ci, C2, C1, C2 is not (since it is the repetition of the prime walk C,, C2). All walks 
Ci, Cr, Co,..., Cy are apparently primes. 

If P is a prime in G, we denote by v(P) the length of the corresponding walk 
and call it the length of the prime. It is easy to see that a friendly graph which is 
not a cycle itself contains at least two different cycle subgraphs and thus infinitely 
many primes, obtained by taking ‘non-periodic’ walks using at least two of the cycle 
subgraphs. Let Pg be the set of primes in G. Then 


tg(k) := #HPeE Pg: v(P) =k} 


is called the prime counting function.*' A crucial parameter with respect to mG (k) is 
Ag := gced{v(P): P € Pg}, since we clearly have mg(k) = 0 in case Ag + k. 

The Riemann zeta-function encodes information about the distribution of primes 
in the set of positive integers and can be used to prove the prime number theorem. 
The equivalent for primes in graphs and the prime counting function zg is the 
Ihara zeta-function, first defined by Yasutaka Ihara® in the 1960s in the context 
of discrete subgroups of the two-by-two p-adic special linear group. In his book 
Arbres, Amalgames, SL°* Serre suggested in 1977 that Ihara’s original definition 
could be reinterpreted graph-theoretically. In 1985 Toshikazu Sunada® put this 
suggestion into practice and defined the Ihara zeta-function of a friendly graph G as 
the complex function 


tou) = T] G-w®)", 


PEePg 


where u € C is supposed to have sufficiently small absolute value for reasons of 
convergence. As other zeta-functions, the Ihara zeta-function satisfies a functional 
equation® and, in addition, the so-called determinant formulae, depending on the 
edge adjacency matrix W,°' of the graph G. It turns out that the absolute value of 
the largest eigenvalue of Wg equals the reciprocal of the radius Rg of convergence 
of ¢g. Let us denote by Ag the second largest modulus of the eigenvalues of Wg. 


61 Observe the = sign as opposed to < in the prime counting function for integers. 
®¥y. Thara, On discrete subgroups of the two by two projective linear group over p-adic fields, J. 
Math. Soc. Japan. 18 (1966), 219-223. 

3— J.-P. Serre, Trees, Springer, New York, 1980. 


4Namely, as a zeta-function related to closed geodesics in graphs—compare this with Selberg 
zeta-functions. 

6ST. Sunada, Riemannian coverings and isospectral manifolds, Ann. Math. 121 (1985), 169-186. 
Tn fact, there exist several functional equations for €g(u) 

67Wg is a 0-1-matrix indicating if a pair (e;, e;) of edges in G is a path (of length 2) in the graph or 
not. In other words, the edge adjacency matrix of G is the adjacency matrix of the line graph of G, 
which is obtained from G by exchanging the roles of vertices and edges. For a precise definition 
look into the books cited in footnote 60. 
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By somewhat imitating methods from analytic number theory to prove the classical 
prime number theorem,® one obtains the prime number theorem for graphs”: 


Theorem 5.1 Let G be a friendly graph, and let k be a positive integer. If Ac + k, 
then mG(k) = 0. For Ag | k, we have 


n(k) = AcX& +0(42) +0(82). (5.1) 


—k 
In particular, we have 1¢(k) ~ Aga& asymptotically for Ag | k with k > oo. 


It is well known that the Riemann hypothesis (for the Riemann zeta-function) 
implies the best possible error term in the prime number theorem for integers. There 
exists a corresponding Riemann hypothesis on the location of the zeros of the Ihara 
zeta-function. Looking at formula (5.1), it becomes clear that best possible error 
terms are obtained for graphs G for which the spectral gap between the largest 
eigenvalue of Wg (which equals R7!, as mentioned above) and the second largest 
eigenvalue Ag of Wg is as big as possible. For a regular graph,’° the spectrum of 
its edge adjacency matrix is closely related to the spectrum of its (vertex) adjacency 
matrix.’! In particular, the corresponding spectral gaps between largest and second 
largest eigenvalues are correlated. Having all this in mind, the following fact seems 
not to be very surprising: Among all friendly graphs G, at least if they are g-regular, 
the ones whose Ihara zeta-function satisfies the corresponding Riemann hypothesis 
are exactly those for which the spectral gap with regard to the adjacency matrix 
A(G) is as large as possible. Observe that g-regular graphs have largest eigenvalue 
q in modulus.” 

By a result of Alon and Boppana’’ we know for g-regular graphs G that the 
second largest eigenvalue A(G) in modulus cannot be essentially smaller than 
2./q—I. Graphs where A(G) < 2./q—TI are known as Ramanujan graphs."* 
Therefore, the Riemann hypothesis for the Ihara zeta-function of a graph G holds 
precisely if G is a Ramanujan graph. 


*8But with less difficulties and, therefore, much less effort. 

For all the details, see the two books mentioned in footnote 60. 

A graph is called g-regular if every vertex has exactly g neighbours. 

"].e. given the spectrum of any regular graph, the spectrum of its line graph can be explicitly 
specified. This follows from a result of Sachs, see, e.g., Theorem 3.8 in: N. Biggs, Algebraic graph 
theory, 2nd ed., Cambridge University Press, Cambridge, 1993. 

™This is an exercise in algebraic graph theory. 


BN. Alon, Eigenvalues and expanders, Combinatorica 6 (1986), 83-96, and R.B. Boppana, 
Eigenvalues and graph bisection: an average case analysis, pp. 280-285 in: Proc. 28th Annual 
Symp. Found. Comp. Sci., IEEE 1987. 


74For a survey, see M.R. Murty, Ramanujan graphs, J. Ramanujan Math. Soc. 18 (2003), 33-52. 


Recent Developments on the Edge Between Number Theory and Graph Theory 423 


Ramanujan graphs are a class of special expander graphs,” having applications 
in communication network theory’® and theoretical computer science.’’ In 1988 
Lubotzky et al.”* constructed an infinite family of (¢+ 1)-regular Ramanujan graphs 
(with growing number of vertices) for each prime g = 1 mod 4. Their proof used the 
Ramanujan conjecture, which led to the name of Ramanujan graphs. Morgenstern”? 
extended the construction of Lubotzky et al. to all prime powers in 1994. The 
problem of such an explicit construction for general g is open. 


6 Multiplicativity 


Arithmetical functions®® were one of Wolfgang Schwarz’s main research areas, and 


multiplicativity is the most important and useful number-theoretic property such a 
function can have. As we shall see, the concept of multiplicativity is also profitable 
with respect to certain graph-theoretical problems. 

In 1978 Gutman*! established the mathematical notion of the energy 


BG > al (6.1) 


A€Spec(G) 


of a graph G as the sum of the absolute values of all eigenvalues of G (counted with 
multiplicities). This terminology has its roots in mathematical chemistry way back 
in the 1930s and is related to certain chemical parameters, e.g. the heat of formation 
of a hydrocarbon.®** By now this concept of graph energy, which is accompanied 


Expander graphs have the following two competing properties: on the one hand, they are fairly 
sparse (in terms of number of edges as compared to the number of vertices), but on the other hand, 
they are highly connected. For a formal definition as well as results and applications, see S. Hoory, 
N. Linial, and A. Widgerson, Expander graphs and their applications, Bulletin (New series) of the 
American Mathematical Society 43(4) (2006), 439-561. 

°F Bien, Construction of Telephone Networks by Group Representations, Notices of the Amer. 
Math. Society 36(1) (1989), 5—22. 

See Hoory et al., footnote 75. 
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by others, has evolved into a significant, much studied mathematical object.** For 
a while it was thought that the maximal energy a graph with n vertices could have 
is 2(n — 1), which incidentally is the energy of the complete graph K,,. This turned 
out to be false, but it triggered the search for high-energy graphs in certain classes 
of graphs. 

Using the characterization of integral circulant graphs ICG(n, D) by the two 
parameters n and D, where n is the number of vertices of the graph and D is a 
set of divisors of n,°* we abbreviate the energy of ICG(n, D) by setting E(n, D) := 
E(ICG(n, D)). We ask for the maximal energy Emax (n), taken over all divisor sets D 
of n for any fixed n. Observe that, since all eigenvalues of ICG(n, D) are integral, the 
energy E(n, D) is an integer as well. As a by-product, the authors’ energy formula®> 
revealed the arithmetical fact that 2(p — 1) | E(p*, D). More importantly, Emax (p*) 
could be determined explicitly, and the divisor sets D maximizing the energy were 
found®® by combinatorial arguments. 

Due to the lack of a closed energy formula for E(n, D), it seemed much more 
difficult to deal with E,,,x(m) for arbitrary n. A crucial obstacle is the problem that 
E(n, D) reveals no signs of multiplicativity with respect to n. This deficiency was 
overcome by developing and using the concept of multiplicative divisor sets.8’ A 
divisor set D of n is called multiplicative if 


D= || D,. 


peP, pin 


where D, contains all powers of the prime p | n which divide some element of D, 
and the product of sets A;,..., A, of numbers is defined by 


] [4 =41--.. Ae = fare... ae: a, € Aj,..., a € Ax}. 


certain finite, planar graphs, pp. 87-114 in: R.J. Wilson ed., Appl. of combinatorics, Shiva Math. 
Ser. 6, Shiva Publ., Nantwich, 1982. 

83For mathematical surveys, see R.A. Brualdi, Energy of a graph, AIM Workshop Notes, 2006 and 
I. Gutman, The energy of a graph: Old and new results, pp. 196-211 in: A. Betten, A. Kohnert, R. 
Laue, A. Wassermann eds., Algebraic Combinatorics and Applications, Springer-Verlag, Berlin, 
2001. 
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For multiplicative divisor sets D of n = p}'---p;‘, we have the desired multiplica- 
tivity of the energy,** namely 


E(n, D) = 7 E(pi', Dp;) 
i=1 


By restricting the divisor sets to those which are multiplicative, the following bound 
could be shown.*? 


Theorem 6.1 Let n be a positive integer. Then 


youd) 


Emax(n) <n ar nt(n)’, 


d\n 


where ~ denotes Euler’s totient function, t(n) the number of positive divisors of n, 
and w(n) the number of distinct prime factors of n. 


Multiplicative divisor sets are also useful when approaching the problem to 
determine the integral circulant graphs being Ramanujan graphs”? at the same 
time. Again, the answer for ICG(p’, D) could be given completely and explicitly.”! 
In the general case ICG(n, D), a certain arithmetic feature, namely n having a 
comparatively dominating prime power factor, is crucial”: 


Theorem 6.2 Let n > 3 be an integer. 


(i) For even n, there is always a multiplicative divisor set D of n such that 
ICG(n, D) is a Ramanujan graph. 

(ii) Ifn is odd and has a prime power factor = wt = 
divisor set D of n such that \CG(n, D) is a Ramanujan graph. 


then there is a multiplicative 


(iii) If the largest prime power factor of some odd n = 8259 is smaller than aa Cee 


then there is no multiplicative divisor set D of n such that \CG(n, D) is a 
Ramanujan graph. 


88Theorem 4.1 in footnote 87. 
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Linear Alg. Appl. 437 (2012), 1408-1421. 
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Abstract The leading coefficient of the nth Stern polynomial defined by KlavZar et 
al. (Adv Appl Math 39:86—95, 2007) is expressed in terms of the binary expansion 
of n. 
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We shall use the notion of Stern polynomials B,,, introduced by KlavzZar, Milutinovi¢é 
and Petr [1], i.e. By = 1, Boy(x) = xB, (x), Bont) = Bn + Bn41. Using the results 
of [2] and denoting by F’, the Fibonacci sequence we shall prove 


Theorem 1 [fn has binary expansion 


a, az 


HEI al kee 0), (1) 


then the leading coefficient c, of By is given by the formula 


I 
cn = | [N(Ad. (2) 
i=1 
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where A; are the blocks of \’s and 2’s occurring in the sequence az,..., ax. N(Ai) 
are given by the formulae valid for integers l,m, where | = 0, m = 0 in (3) and (4); 
1> 1, m> 1 in (5) and (6), 


nae 1 if Lis odd, 


j oo (3) 
a +1. if lis even; 
N(2) = Fina; (4) 
and 
m—1 
m A, 2 if Lis odd, 
N(A, 2, = - ae m v ee (5) 
SN(A, 2)+N(A,2)_ if Lis even; 
l 
lm FitiN(A, | if Lis odd, 
N(A, 1,2) = va) ere (6) 


l 1 
Fint- N(A, 1) + FinN(A, 1) if Lis even, 
where A is a block of \’s and 2’s either empty or ending with 1 or 2 in case (5) 
or (6), respectively. 


Example Ifn = 2”+® +2 —1 (ay, a3 positive integers), then c, = 3 forn = 19; 
Cn = 2 forn = 5, 27? + 3 (ay > 2), 2877 + 28 — 1 (a3 > 2); c, = 1, otherwise. 


Corollary 1 For n odd, B, is monic if and only if (1) holds and the blocks of \’s 


and 2’s occurring in the sequence ao, ... , ax have the form 
mo ly my L In Mn 
12 bad, 


where 1; is odd,0 < my) < 1,1 <m;<2(0<i<k),0<m<1(k> 0). 


Theorem 2 Let an integer m > 0 be given. For almost all n, in the sense of density, 
Cn = 0 (modm). 


Corollary 2. For almost all n, in the sense of density, B, is not monic. 


Corollary 2 has been conjectured by M. Gawron (written communication). 
For the proof of Theorem | we need a definition and five lemmas. 


Definition 1 Let for a block A of 1’s and 2’s occurring in the sequence ap, ... , ax 
on the positions a+ 1,...,b (1 < a <b) N(A) be the number of integer sequences 
i, (1 < A < A) such that 


a<i+2<b (A< A), (7) 
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and forA < A 
hei 2iAt+2 (8) 
and 
2 ly+1 Ly +1 
1 J 
Ye - a4) =| [+.-+[4—], (9) 
2 2 

A=1 

where L),..., ZL, are the lengths of the blocks of 1’s contained in A. 


Lemma 1 (3) holds. 
Proof In this case the condition (7) and Eq. (9) reduce to 


a<it+2<a+lI14l (10) 
and 
I+1 
an} ay 
respectively. If / is odd, the only sequence satisfying (8) and (10)-(1 1) is 
i =a+2A-3 (l<A< A), 
while if / is even the sequences satisfying (8) and (10)-(11) are 
ip =at+2A-3(1 <A <h), 2h =a+2h-2(h<A< A) 


forh =1,...,4 +1. Thus (3) follows. 
Lemma 2 The number of integer sequences i, (1 < 4 < A), including the empty 
sequence, such that (8) holds and 


l<i<m (m>0,1<A< A) (12) 


equals Fiy+. 


Proof The lemma is true for m = 0, where only the empty sequence satisfies the 
conditions (8) and (12) and for m = 1, where there is only the empty sequence and 
the sequence {1}. Assume now that the lemma is true for all m < n > 2. Then every 
sequence satisfying 


I< sn (lSA< A) 
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has either 

in <n (13) 
or 

v2. (14) 


The number of sequences satisfying (8) and (13) is, by the inductive assumption, 
F’,+41, the number of sequences satisfying (8) and (14), since i,_; < n — 2 is, by the 
inductive assumption, F,,. Therefore, the total number is F,, + Fy41 = Fn+2, which 
completes the inductive proof. 


Lemma 3 Equation (4) holds. 


Proof In this case the condition (7) and Eq. (9) reduce to 
a<it2<at+m (<A) 


and (4) follows from Lemma 2. 
Lemma 4 Equation (5) holds. 
Proof Let the length of the block A be @ and the lengths of the blocks of 1’s 


contained in A be Lj,..., Lj-;. Then the conditions (7) and Eq. (9) reduce to 
i, +2 EA (a<ip+2<a+a), (15) 
dj42=2 (ata<i+2<a+a+4+m), (16) 
Qyt2=1 (Qtat+tm<ip+2<a+a+m4+), (17) 
I; +1 Lj-1+1 
C=) =|] +...+ |], (18) 
2 2 
a<i,+2<a+a 
I+1 
PS (2 — 4,42) = | |- (19) 
ata+m<i,+2<a+ta+m4l 


Let the least A satisfying (17) be Aj. If / is odd, the only sequence i, (A > A1) 
satisfying (8) and (17), (19) isi, =a+a+m-+ 2(A — Aj) — 1 and thus, by (8), 
either A; = 1, orig,-1 +2 <a+a+m-— 1. Therefore, 


m 1 m—1| 
N(A, 2,1) = N(A, 2 ). (20) 
If / is even and (8) holds, then either A; = 1, or 


in, =ata+m+1>ig-14+2, (21) 
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or 


ia; Data+m. (22) 


The number of sequences i, (A > Aj) satisfying (8), (17), (19) and (21) is, 
by Lemma 1, i, on the other hand, the number of sequences i, (A < Aj) 
m—1 
satisfying (8), (16) with a+a-+m replaced by a+a+m-—1and(18)isN(A, 2 ). 
The number of sequences i, satisfying (8), (17), (19) and (22) is one, on the other 


hand, the number of sequences i, (A < Aj) satisfying (8), (16) and (18) is N(A, 2). 
Therefore, 


m 1 1 m—1 m 
N(A,2,1) = 5M(A, 2) +N(A,2) (23) 


and (5) follows from (20) and (23). 
Lemma 5 Equation (6) holds. 


Proof Let the length of the block A be @ and the lengths of the blocks of 1’s con- 
tained in A be L),..., 7-1. Then the condition (7) and Eq. (9) reduce to (15), (18), 


djt42=1 (ata<ij+2<ata+)), (24) 
di42=2 (atatl<i+2<a+a+l+m), (25) 
I+] 

FE e-aa=|44) 29 


ata<i,+2<atat+l 


Let the least A satisfying (25) be A. If lis odd, then (8) and (26) imply that A2 > 1, 
ian +2=atat+/l<ing,, 


thus, by Lemma 3, the number of sequences i, (A > Az) satisfying (8) and (25) is 


Fn41. On the other hand, the number of sequences i, (A < A2) is N(A, i), thus 
lm 1 
N(A, 1,2) = Fin4 i N(A, 1). (27) 
If / is even, then either 
ing Patatl (28) 
or 


in, =atatl—-1. (29) 
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The number of sequences i, (A > A2) satisfying (8), (25) and (28) is, by Lemma 3, 
Fin41. The number of sequences i, (A < Az) satisfying (8), (15), (18) and (24) is 


l 
N(A, 1). The equality (29) implies by (8) i4,41 + 2 > a+a-+/1-+ 3, thus the 
number of sequences i, (A > A2) satisfying (8), (25) and (29) is, by Lemma 3, Fin. 
On the other hand, by (8), i4,-1 < a+a-+/-—1, hence the number of sequences i, 


i-1 
(A < Az) satisfying (8), (15), (18) and (29) is N(A, 1 ). Therefore, 


lm 1 l-1 
N(A, 1,2) = Fine 1N(A, 1) + FnN(A, 1) (30) 


and (6) follows from (27) and (30). 


Proof (Proof of Theorem 1) By Theorems 1 and 2 and Lemmas 5 and 6 of [2], cy is 
cardinality of the set S of integer sequences i, (1 < A < 4) such that 


lL 

l 1 +1 

i, >0, (8)holds forall A <j and Ye-ai42) =| |e-4[4 i; 
4=1 


where /},...,/; are lengths of blocks of 1’s occurring in the sequence a,... , a. 
Since the right-hand side is the maximum taken by the left-hand side for all 
sequences i, satisfying (31) we have for i, € S, aj,42 < 2 and 


I 
IS| =] [N@o, 
i=1 
where A; are blocks of 1’s and 2’s occurring in the sequence a2,..., a, and N(A;) 


are defined above. Theorem | follows now from Lemmas 1, 2, 4, 5. 
Proof (Proof of Corollary 1) This follows at once from Theorem 1. 
For the proof of Theorem 2 we need a definition and a lemma. 


Definition 2 Let n satisfy (1). J(n) is the number of the blocks A;(7) of 1’s and 2’s 
contained in a2,..., dx. 


Lemma 6 For every fixed integer 1 > 0 the asymptotic density of odd n such that 
2 
for a certain i < J(n): Aj(n) = 1 is 1/2. 


Proof Suppose that the lemma is false. Then there exists ¢ > 0 and a sequence 
xy such that x, — oo and the number of odd n < x, such that for all i < J(n): 


2 
Aj(n) # 1 is at least ex,. Put s, = ey ]. We may assume that for all v 


Suit S sy WG. (32) 
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The set S, of odd a < 2° such that for all i < J(a): Aj(a) 4 1 satisfies 
2! > [Sy] > 221. (33) 
Let T,,(x) be the set of all n < x such that 
ns DFS 4 gsv +244 4 gsvt2i+3 
$e vttl 4 atl 4 4 a4+3 4 g (mod 24245) ae Sy. 


We have by (33) for all v 


x Ex Ka 
IT, (x)| > ISvI( sare — 1) > x7 2 . (34) 
Distinct sets 7, (x) are disjoint. Indeed, if v > jz and 
né€ T(x) NT, (@) 
then by (32) and (34) 
resnmod2°" = Qut2l+5 a Qut2l+4 ai DWut2l+3 
+ Qsut2l+1 oe Qsut2-1 A+ Qsut3 +a (mod igre. ae Le 


thus the binary expansion of res mod 2°” contains the segment 


11101...01000 
rer corre 
21 


contrary to the definition of S,,. Put 
DEFT 


p(y, a 
€ 


Then, since the number of odd n < x is at most ae we have by (34) 


y2I+6 


et E E 
i 2 IU T,(0)| = S°IT.0| 
v=1 v=1 
Xx EX 


ye) a anaes 


ry Xx 
2247 2 se ea 


The obtained contradiction completes the proof. 
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Remark 1 One can prove similarly that for every block A of 1’s and 2’s with 0 < 
|A| < oo the asymptotic density of n with i < J(n) and Aj(n) = A is 1/2. 


Proof (Proof of Theorem 2) Taking in Lemma 6: 1 = m— 1 (m > 1) we obtain from 
Theorem | that the asymptotic density of odd n such that c, = 0 (modm) is 1/2. 
It follows that for every e > 0 and for every positive integer k the number of odd 
n < + such that c, = 0 (modm) is > (1 — e)x/2@+) for x > xo(e,k). Without 
loss of generality we may suppose that xo(e, k) is increasing in k. Since co, = Cn, 


taking k = Las we obtain that for x > xo(e, k) the number of n < x such that 


Cn = 0 (mod) is at least (1 — =r —&)x > (1 — 2¢)x, which proves the theorem. 


Proof (Proof of Corollary 2) It suffices to take in Theorem 2: m = 2. 
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Abstract We show that universal elliptic Carmichael numbers do not exist, answer- 
ing a question of Silverman. Moreover, we show that the probability that an integer 
n, which is not a prime power, is an elliptic Carmichael number for a random curve 
E with good reduction modulo n, is bounded above by O(log! n). If we choose 
both n and E at random, the probability that n is E-Carmichael is bounded above by 
O(n"! / aa 
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1 Introduction and Results 


Let E be an elliptic curve defined over Q. Let L(s, E) = > 5 onlE ) be the L-series 
associated with E. If p is prime, then |E(Z/pZ)| = p — a,(E) + 1. Hence, if 
a,(E) is computable, checking whether |E(Z/nZ)| = n — a,(E) + 1 is a necessary 
criterion for n to be prime. Unfortunately the order of E(Z/nZ) is not directly 
accessible, so in practice one chooses a point P on the curve, and checks whether 
(n—a,(E) + 1)P =0. If this condition is satisfied for every P € E(Z/nZ), but n is 
not a prime power, we say that n is a Carmichael number for the curve E. 
Silverman [9] defined a universal elliptic Carmichael number to be an integer 
n, which has at least two different prime factors, but is a Carmichael number 
for every elliptic curve FE, which has good reduction modulo n, and asked whether 
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universal elliptic Carmichael numbers exist. Here we show that such numbers do 
not exist. In fact, we can explicitly give parameters a,(E), which imply that n is not 
an elliptic Carmichael number for the curve FE. While very explicit, this proof leaves 
the possibility that an integer n is elliptic Carmichael for most curves. Therefore we 
are also interested in the probability that an integer n is elliptic Carmichael for a 
random curve E. We prove the following: 


Theorem 1.1 Let n be an integer, which is not a prime power. 


1. There exists a curve E with good reduction modulo n, such that n is not an elliptic 
Carmichael number for E. 

2. As n tends to infinity, the probability that n is elliptic Carmichael for a random 
curve E is O(log! n). 

3. Ifn € |x, 2x] is chosen at random, and E is a random curve, then the probability 
that n is E-Carmichael is O(x~'/8+¢), 


The implied constants are absolute and not too large, and come mostly from 
replacing terms of the form log log n by log‘ n. 

Note that Luca and Shparlinski [5] considered the dual question to part 2 of 
Theorem 1.1, i.e. if E is a fixed elliptic curve and n is chosen at random, then the 
number of E-Carmichael integers in [x, 2x] is Orion)” The argument involved is 
quite different from our arguments, in particular, we do not have to consider twists 
L(s, E, x) of the L-series associated with E. 

Our notation follows the standard for analytic number theory. We denote by log, 
the k-fold iterated logarithm, e.g. log, n = loglogn, by w(n) the number of prime 
factors of n, P+ (n) the largest prime factor of n. For an integer n = [| p;' we call 
y(n) = [[p; the squarefree kernel of n. For an integer n and a prime number p, 
denote by v,(n) the exponent of p in the prime decomposition of n, that is, the 
largest k such that p*|n. For a group G we put exp(G), the exponent of G to be the 
least integer n > O, such that g” = 1 for all g © G. We use < as a synonym of 
= O(...), an index at a Landau symbol indicates that the implied constants depend 
on the index. 


2 Preliminaries 


In this section we collect results on the L-series of an elliptic curve and on 
multiplicative number theory. 
Our first two statements are well known, and can, e.g., be found in [8]. 


Lemma 2.1 The function n +> a, is multiplicative, and satisfies \ay| < 2°” Jn. 
For each p there exists a complex number ay of modulus ,/p, such that a,x = ak + 


Qa”. 


Lemma 2.2 The group E(Z/pZ) is a two-generated abelian group, and we have 
exp(E(Z/p*Z)) = p‘! exp(E(Z/pZ)). 


The Non-existence of Universal Carmichael Numbers 437 


Define the function e : N — N to be the multiplicative function satisfying 
e(p*) = plki71, 
Lemma 2.3 Let G be a two-generated abelian group. Then e(|G|)| divides exp(G). 


Proof It follows from the classification of finitely generated abelian groups, that G 
can be written as G Y Z/mZ ® Z/n2Z with nj|nz. Clearly exp(G) = nz, and we 
have that if p*|n;nz, then p!*/2!|nz. Hence our claim follows. 


We will repeatedly use the following alternative definition of an elliptic 
Carmichael number, which is [9, Proposition 12]. 


Lemma 2.4 An integer n is elliptic Carmichael for the curve E if and only if E 
has good reduction modulo n, and for each prime divisor p of n we have that 
exp(E(Z/p’?\™Z)) divides a — dy + 1. 


Deuring [2] determined the number of curves modulo p having prescribed order. 


Lemma 2.5 Let p be a prime. Then the number of curves E modulo p with 
|E(Z/pZ)| = p — ay + 1 equals H(4p — a), where H(n) is the Kronecker class 
number, which can be computed as follows. Write n = n'f?, where n' is squarefree 
and (n', f) = 1. Then 


mn) = 40(1.(5)) wn 


ms 


where w is the multiplicative function defined by 


I if (7) = 9. 
VP) = 41, if (2) =1, 
ee em 


The function wy satisfies 1 < W(f) « log3f. 

Using this result Lenstra [3] proved the following: 
Lemma 2.6 Let p be a prime, S © [p — 2./p,p + 2,/p] a set of integers. Then the 
probability that an elliptic curve E chosen at random satisfies |E(Z/pZ)| € S is 
bounded above by O (4 log p logs P). 


If |S| is not too small, we can do better. 


Lemma 2.7 For every fixed c we have that if p is a prime, and S © [p — 2./p,p + 
2./p] a set of integers satisfying |S| > p‘, then the probability P that an elliptic 
curve E chosen at random satisfies |E(Z/pZ)| € S satisfies 


RY 
P<, a) log3 p. 


VP 
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IfS S [p— (2—c)./p,p + (2—¢)./p], we also have 


[S| 


P>.—— 
4/P log; p 


For the proof of Lemma 2.7 we need some zero density estimate. Since we are not 
interested in the implied constants, any result would do. We choose the following, 
because it is at the same time quite simple and very well known. We refer the reader 
to the Montgomery’s book [6] for more detailed information. 


Lemma 2.8 For an integer Q, a real number T > 1, and a real number o > 1/2 
define N(o, Q,T) to be the number of pairs (p, x), where Rep > o, |Imp| < T, 
x is a primitive character to a module q < Q, and L(p, x) = 0. Then we have 
N(o,Q,T) « (PT) 34-0) log© (QT) for some constant C. 


Proof (Proof of Lemma 2.7) \t suffices to show that for all gq < Q = p + 2./p with 
at most p* exceptions we have that 


logs! cL (1 (<)) log, q. (2.1) 


In view of Lemma 2.8 it suffices to prove that (2.1) holds true under the assumption 
that L (s, (=) has no roots in the domain Res > 1 — c/8, |Ims| < Q. But under 
these assumptions (2.1) was essentially shown by Littlewood [4]. 


Lemma 2.9 There exists an absolute constant c > 0, such that for all prime 
numbers p € [x,x?] with at most one exception we have that the probability that 
a,(E) = 1 holds true for a random curve E is > Trice” 


Proof This follows immediately from Lemma 2.5 and the fact that there is at most 
one modulus q € [x, x] for which a Siegel zero exists. 


We shall also use the following consequence of Baker’s bound for linear forms 
in logarithms: 


Lemma 2.10 Let p be a prime, E an elliptic curve. Then we have a,x # 1 for all 
k > 3-107 log p(46 + log, p). 


Proof Define a, as in Lemma 2.1. If a,« = 1, then ak = 1 =f"; thus 
0 < A= |kloga — klog@ — log(—1)| < |Ja|~* = p-?. 


Here a, @ are algebraic numbers of degree 2 and height < p, hence from the work 
of Baker and Wiistholz (confer, e.g., [1, Theorem 2.5]) we obtain 


A > exp(—96!” log” p max(1, log k)). 
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Comparing these bounds we obtain 


kl 
oer < 96'° log” p max(1, log k), 


< 96!° log p, 
Tek 


which implies in particular k < 3- 107° log p(46 + log, p). Hence our claim follows. 


Lemma 2.11 Let p be a prime, k an integer. Then there are < k integers a,... , Ot, 
such that for all a and all elliptic curves E we have that a,«(E) = a implies a)(E) € 
{Q@1,..., x}. Similarly there are k integers, such that for all a and all elliptic curves 
E we have that a,«(E) = a (mod p) implies (ap(E) mod p) € {a,..., &k}. 


Proof Define a as in Lemma 2.1. There are two complex numbers with modulus 
p*/? and real part a/2. We may replace w by @, and may therefore assume that or“ 
is uniquely determined by a. Hence there are k possible choices for @ realizing a, 
and our first claim follows. The second claim follows similarly by considering F,,2 
in place of C. 


Lemma 2.12 The number of integers n € |x,x + ./x], which satisfy e(n) < } is 
ore + x!/3 log? x). 

Proof Write n = ab*c*, where a contains all prime divisors p of n, such that p* + n, 
c contains all prime divisors of n, which divide n with an odd exponent > 3, and 


b = V/n/ac?. We have e(n) = n/bc, hence the number of integers n € [x,x + ./x] 
satisfying e(n) < 2 equals 


x+ Jx x 
yas bic 3 *|- l= ;|= y= we 3 ~ +4+0(M"?) 
be>k be>k 
b?3<M 


+ O({ne [x,x + /x] : db,c: be>k, b’o>M, b’c3|n}), 


where M is a parameter to be chosen later. The first sum is O(-£). To estimate 
the set in the second error term consider all integers n € [x,x + ./x], which can 
be written as n = ab’c?, where b?c? > ./x, anda € [A,2A], b € [B,2B],c € 
[C, 2C]. Clearly the whole range can be covered by O(log? x) such intervals, hence, 
it suffices to estimate the maximum of the number of such n over all A, B, C with 
2>B2C > ./x. If two of the three integers a, b, c are determined, and M > ./x, then 
there is at most one choice of the third such that ab2c? € [x,x + ./x], thus we obtain 
that the last error term is 


< log® x max{min(AB, AC, BC) : AB’C? < x, B°C? > M}. 
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It is easy to see that the maximum is attained for AB?C*? = x, hence the second 
factor becomes 


max{min(xB/M,xC/M, BC) : B°C? > M}. 


The function is non-decreasing in both B and C, hence, we may assume that B?C? = 
M, and we obtain that the last quantity is 


max{min(x/(BC°), x/(B?C7), BC) : B’C? = M} « x13 


since we can neglect the first term in the min and the side condition. Hence our 
claim follows. 


3 Integers with Special Multiplicative Structure in Short 
Intervals 


In this section we estimate the number of integers n in an interval of the form [x,x+ 
c./x], which satisfy certain constraints concerning their prime factorization. 


Lemma 3.1 Let P be a set of prime numbers, and let x be a sufficiently large real 
number. Then the number N of integers n in the interval [x,x + ./x] which can be 
written as dt, where d < x?/> and t contains only prime factors from the set P 
Satisfies 


x log |P| 


ogx 


N<|P|+ 


If |P| < ee thenN < |P\. 


Proof Let N © [x,x + ./x] be the set of integers which contain only prime factors 
from P. We have 


I] n < 4 2IN1/3 I] I] pe, 


neN pEP neN 


For a prime number p € P the highest power of p dividing an element of [x, x + ./x] 
is <x-+ ./x. Subtracting this element from all other elements of NV we see that 


Suis MEO Pome (n[ 


neN - logp k>1 
l 
logp P 
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On the other hand, we have The yn x" If we compare these bounds we obtain 


lo 
Nlogx!/? < |Pllog(x + vx) +N S> logp+ Vx D> =" 
pEP pEP 


p<a/x/N p>J/x/N 


< 2|Pilogx + (1+ o(1))N log 4 + (1 + 0(1))V/x log |P| 


IfN> ae. the first claim holds true anyway. Otherwise we obtain 


log,x | x log |P| 
logx logx 


> 


N < 6|P| + 2N 


logy x 
4logx’ 


the prime number theorem to be < (; + o(1)) log x. Hence 


and the first claim holds true again. If |P| < 


we estimate per log p using 


1 1 
3 osx < 2|P|logx + 7 losx, 


which implies N < 24|P|. 
Next we prove the following: 


Lemma 3.2 There exists some c > 0, such that for x sufficiently large there are at 
least c,/x integers n € [x,x + 0.1,/x] which satisfy Pt (n) > x!/?*¢, 


Note that the question of finding integers with a large prime factor in a short 
interval has been studied since the work of Ramachandra [7], who proved that there 
is some c > 0 such that [x, x + x!/?~¢] contains an integer divisible by a prime factor 
> x!/2+1/I3 | however, for the present application we need that such integers not only 
exist but in fact are quite frequent. Still, although we could not find this result in the 
literature, the methods we use here are not new, and we will therefore be quite brief. 
The proof relies on the following two results: 


Lemma 3.3 (Vaughan’s Identity) For integers U,V,n with U <n we have 


A(n) = — D7 Amn (d) + DF Hd) logh— | Alm) paz) 


mdr=n hd=n mk=n d|k 
m<U d<V m>U d<V 
d<V k>1 = 


The following follows from Weyl’s estimates: 


Lemma 3.4 There is some a <_ 1, such that for N < x23 we have 
ean gs <« iN 
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Proof (Proof of Lemma 3.2) We estimate the number of integers in the interval 
[x, x + 0.1,/x], which have a prime factor > N by computing the sum 


Yat (+24) - (:]) 


n>N 
= Yo Ad awe a - Yo A) [=] 
n<x+0.1,/x n<x+0.1,/x 
= x+0. aN |- x 
yam(| )) 
= log{x + 0.1 Vx]! — logha]! — }* A(n) (| = [=)) 
n<N 


= 0.1 /xlogx + O(logx) — )* An) (|) = (*]) 


n 
n<N 
Let B(t) = t— [4 - 5 be the saw tooth function. Then we have 


3 Aon ([ 298) - (:]) 


N<n<2N 


=01ve >> AO). 0( y- ain (5)) 


N<n<2N N<n<2N 


+0( S> A(n)B (ao) 


N<n<2N 


= (AE + ot Evol D Ane (= ) 


<n<2N 


+0( S> A(n)B (4) 


N<n<2N 


Approximating B by a trigonometric polynomial and applying Lemmas 3.3 and 3.4 
we see that for N < x?/> the error terms are < N®% for some a < 1. Putting these 
estimates together we obtain 


An ([-)- [=]) = 0.1/xlogx —0.1/xlogN + O(N*). 


n>N 
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Hence, if N < cx!/@%) and c is sufficiently small, the left-hand side of the last 
equation is > 0.05./x log x, and our claim follows. 


4 Proof of Theorem 1.1: The Non-existence of Universal 
Carmichael Numbers 


In this section we prove the first part of Theorem |.1. We begin with the case that n 
is not squarefree. 


Lemma 4.1 A universal elliptic Carmichael number is squarefree. 


Proof Let q be a prime number such that q”|n, and let E be a curve such that ag = 0. 
Then the algebraic number @ defined in Lemma 2.1 equals ./—p. Hence we obtain 


2 way = 2Re(—p)"*/D = 0, vq(n) = 1 (mod 2), 
: +2p%4™/?__ y(n) = 0 (mod 2). 


By multiplicativity we conclude that qg|ay. Since g|n, we obtain that n — a, + 1 is 
not divisible by g. But since q*|n we have that g| exp(E(Z/q’"Z)), hence, from 
Lemma 2.4 we see that 7 is not an elliptic Carmichael number for E. 


Next we consider the case that n is squarefree. 


Lemma 4.2 Let n be a squarefree integer which has two different prime factors 
p.q, and let E be an elliptic curve which satisfies ap(E) = 1, ag(E) = 0. Then n is 
not elliptic Carmichael for the curve E. 


Proof From a,(E) = 0 and multiplicativity we obtain a, = 0. On the other hand, 
dy = | implies that exp(E(Z/pZ)) = p, hence if n is elliptic Carmichael for E, then 
p divides both n and n — a, + 1 = n+ 1, which is impossible. 


Clearly the case a, = 0 is quite special. If n is not the power of a single prime, 
we can give examples of elliptic curves for which n is not E-Carmichael which are 
not supersingular for any prime divisor of n, however, for the prime power case 
we did not find such examples. Unless n has a lot of very small prime divisors, the 
probability for the event a, = 0 is very small, which might leave the impression that 
being E-Carmichael is not a rare event. This impression was the main motivation for 
the remaining parts of Theorem 1.1. 


5 Proof of Theorem 1.1: The Probability for n Fixed 


The proof of the second part of Theorem 1.1 follows a bootstrap strategy. Our aim 
is to show that a potential counterexample has to be divisible by many small prime 
numbers to the first power. To do so we show that if n is a counterexample, then n 
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is not divisible by large primes, and by only few medium sized primes, and that r val 
is small. We shall begin with rather weak results in this direction, and then use the 
results on large prime divisors to strengthen the results on y(n), and vice versa. 


Lemma 5.1 Let n be an integer satisfying y(n) < 2~°” /n. Then the probability 


log!/3 n 
. . . . . -—c 
that n is an elliptic Carmichael number for a random curve E is O(e © %2" ) for 
some c > 0. 


Proof If E is a curve, for which n is an elliptic Carmichael number, en we have 
that exp(E(Z/nZ)) divides n — a, + 1. From Lemma 2.2 we see that oa divides 


exp(E(Z/nZ)), hence sy lt an +1. Clearly 75 my IMs and we have |a,| < 2°” /n, 
thus our assumptions imply a, = 1. By multiplicativity this implies a, = £1 
for all prime divisors p of n. 
From Lemma 2.10 we find that v,(n) « logp log, p for all prime divisors p 
of n. From Lemmas 2.6 and 2.11 we see that the probability that Appin) = +1 is 
v»(n) log p log3 p log? p log? p 
<< ar an <«K ip 
Then the probability for 2 vi) = tl is « e~ Vicen/2 logn, which is sufficiently 
0 


. Suppose that n has a prime divisor po > e Vlog”, 


small. If n has no prime divisor > eV'8" then we have for each prime divisor p of 
n that 


pe” < eO (log? p log, p) < O(log?!” nlogy n) 


1/3 : ; mee ‘ 

thus n has >> Te — different prime divisors. For each of them with a bounded 

number of exceptions we have that the probability for the event Ay) = +1 is 

< 1/2, hence, the probability for the event that Ayn) = +1 for all prime divisors 
log!/ 3h 

of nis < e © 2" , and our claim follows. 


Lemma 5.2 There exists some c > 0, such that for every € > O there exists some 


no such that all integers n > no with y(n) ca - 


n is an elliptic Carmichael number for random curve E is O(log” ' n). 


Proof Assume first that n is divisible by a prime number p, such that p? } n. If n 
is an elliptic Carmichael number for E, then exp(E(Z/nZ)) is divisible by n/y(n), 
hence we have a, = | (mod n/y(n)). By aa this implies a, = +1 


(mod n/y(n)), that is, a, takes on only O(1 + = wo) values. We conclude that the 
probability that n is elliptic Carmichael for a random curve is bounded above by 


log p(loglogp)* —_ logp(log log p)* log? 
sploglogp)” , logplloglogp)”) _ (108 p 


O er Se Ser 
TW ®) ae BP 


). 
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Clearly the conditions a, = +1 (mod —-,) are independent for different primes p, 


ya) 
hence we conclude that if the product of all primes p|n, such that A + n supersedes 


log? n log} n, then the probability for n to be E-Carmichael is < Tog aE 
Next suppose that p > 5 is a prime, such that p?|n. We have 4 | exp(E(Z/nZ)), 


rT 
hence, if 7 is elliptic Carmichael for the curve EF, then 


—|n-a vpn) Aprpiny) + 1, 
n P Pp 
y(n) /P 


Since => nie 7 |n, and Ta oa 
O(1) ways: This in turn implies that a, is determined in O(v,(n)) ways, and we find 
that the probability that n is E-Carmichael for a random curve is bounded above by 


> pe! > p»™/2, we conclude that A») 18 determined in 


v,(n) logplog,p _ lognlogp log, p 
ee eee 
JP Jn 


Hence if p > log’ n logs n, our claim follows as well. 

In particular we find that if n has a prime divisor p > Clog*n logs n, then n is 
E-Carmichael with probability O(log”! n), no matter whether p?|n or not. 

Next suppose that n is divisible by 3 prime numbers pj, pz, p3, such that p; > 
7% logn, and 2 < vy, Ws < 4.PutQ =[]j_.p Bee . Then we have that ag is uniquely 
acne modulo —~ TC , and since 16./0 « TC , we have that ag is determined in 


O(1) ways. If ag is fixed, then a »,,i,a sp)» a y(n) are three integers with product 
Py P3 


dg, thus the number of choices for this triple i is bounded above by 


Ol logQ ) (10822) 
max 13(m) K e229" ~ e Sbhs3"", 
m<Q 


We conclude that the number of choices for the triple (dp, , ap), dp,) is < log* n. 


Each fixed triple can be reached with probability < oe. Hence the probability 
that n is E-Carmichael is bounded above by 
log< n 1 
< —, 
P lo g3/ 2-€ 


and we are done. 
We now bound y(n). We have 


y(n) =| [p 


pin 


=[|r I] elle 


pin pln pin 
prin 2<up(n)<4 vp(n)>5 
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<log’*n |] pl Ip 


P< logn p|n 


< n°! Jog!® n, 


and we are in the range already covered by Lemma 5.1. 


Lemma 5.3 Suppose that n is divisible by a prime number p > n°’. Then the 
probability that n is elliptic Carmichael for a random elliptic curve is O(n~®-””). 


Proof Write n = dp. Then we have e(p — ap + 1)|n— a, + 1, that is, e(p — ay + 
1)|dp — agay + 1. Since e(p — ay + 1) divides p — a, + 1, we obtain 


e(p—ay +1)|n—an+1—d(p—ay+1) = (d—ag)ayp—d+1 < 2(d4 12a) issn no, 


If n — a, + 1 —d(p— ay +1) & 0, we obtain that e(p — ay + 1) < pn, 
which happens with probability < n~°-° in view of Lemma 2.12. If this quantity 
vanishes, then the divisibility property becomes trivial. But then we have a, = =, 
which has at most one solution. Hence the probability for the event that this quantity 
vanishes is O(serrme), which is even smaller. 

The previous lemma will not be used directly, but together with Lemma 5.2 we 


obtain the following, which shall be used repeatedly. 


Lemma 5.4 Suppose that n has not two prime divisors p\,p2, which satisfy pi > 
0.1 logn, and p? + n. Then the probability that n is elliptic Carmichael for a random 
curve E is O(log! n). 

Proof Let n be an integer. If Pt (n) > n°’ or a log* n, our claim follows from 
Lemma 5.3 or Lemma 5.2, respectively. Hence assume that n satisfies none of these 
statements. We claim that for n sufficiently large this already implies that n has two 
prime divisors as described in the lemma. To see this assume the contrary. Let p1, po 
be the two largest prime divisors of n such that Pp? + n. We want to show that these 
divisors exist and satisfy p; > 0.1 logn. Suppose the contrary. Then we have 


< [| Pp: Pt(n)- (= ) = 20-1401) 1087 70-7 top8 py < 9, 
(n) 


eth llogn 


which gives a contradiction. 


Lemma 5.5 Let n be an integer, pj < p2 be prime divisors of n such that p? tn. 
Write n = dp,po. Let E an elliptic curve such that n is elliptic Carmichael for E. 
then one of the following holds true: 


1. dp, is uniquely determined by ag, Gp); 
2. e(p2— dp, + 1) < 4,/P1(p2/p1)"""; 
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3. Putting t = (e(p2 — ap, + 1),n + 1) we have tlagap, and t > a 


Proof Tf nis elliptic Carmichael for E, then 
€(p2 — Ap, + 1) |n— an + 1 = n— gdp, ap, + 1. 


Assume that aq and ap, are given. Put tf = (e(p2 — dp, + 1),n + 1). Then tlaga,,, 
and dp, is uniquely determined modulo g = Apert Th) On the other hand, we have 
lap,| < 2./p, hence, if g > 4,/pji, then we have that a,, is uniquely determined 
in terms of ag, dp,. If g < 4,/pi, we have that t > pi? or e(p2 — dp, + 1) < 
4,/Pi(p2 / pi)!/ ®. Hence in any case one of the statements (1)-(3) has to be true. 


Lemma 5.6 Suppose that n has L different prime divisors < M?. Then n is E- 
Carmichael with probability O(log”! n). 


Proof For 1 < v < M let a, be the number of prime divisors of n in the interval 
[v2, (v + 1)?). If p is a prime divisor of n in [v, (v + 1)”), then by the lower bound 


contained in Lemma 2.6 we have with probability > vieguions that (p—a,+1,n) = 
v. 


Lemma 5.7 The probability that n is elliptic Carmichael for a random curve E and 
at the same time a,(E) = 0 is O(log”! n). 


Proof Suppose that n has two prime divisors p, p2 > el” Then we obtain e( pi- 


apy + 1)|n + 1, and by Lemma 3.1 the probability for this event is < eee < 
logy n 


jog?!” Since these two events are independent, our claim follows in this case. 

The same argument applies if n has > log, n prime divisors > log n, where C is 
a sufficiently large constant. 

Let c > 0 be a constant as in Lemma 3.2. Suppose that n has > logs n prime 
divisors which are > log’ n. If q > 4./p is a prime divisor of p — ay + 1, then g 
does not divide any other number of the form p—a+1, |a| < 2./p. By Lemma 3.2 we 
have that the probability for the event Pt (p—a,+1) > p!/?*¢ is bounded away from 
0, together with the fact that n has at most log n different prime divisors we conclude 
that probability for the event that P* (p—a,+1) > p'/?*¢ and P*(p—a,+1) + n+1 
is bounded away from 0. But in the latter case we either have a, # 0 or that n is not 
E-Carmichael. Since these events are independent for different p, we see that in this 
case our claim holds true as well. 

Now assume that all prime divisors of n are < n°’, at most one prime divisor is 


> els”? ” and at most log, n prime divisors of n are > log© n. We then give a lower 
bound for the product m of all prime divisors p < log“ n of n, such that p* + n. We 
have 


n 


m> = > n°? 
nlogs "(log© n) $82 n 


P+ (njelos””* 
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hence, n has at least 0.1 ao8% prime divisors p < log*/? n, such that p? + n. It then 
logs n 


follows from Lemma 2.9 that the probability for the event that there exists a prime 
divisor p of m such that a, = | is 


c 0.1 log n/ log, n ol log!/4 n I 
> 1 _ (1 = | = 1 —e logs nlog3 n > 1 = : 
log?/* nlog, nlog;n log 'n 


hence, we may assume that there exists some p with a, = 1. But then p—a, + 1 = 
p|n + 1, contradicting p|n, and the proof is complete. 


Therefore it suffices to consider the probability that n is elliptic Carmichael and 
satisfies a,(E) 4 0. 


Lemma 5.8 There exists a constant C such that if n is an integer, which is divisible 
by > log,n prime numbers p > log n, then the probability that n is elliptic 
Carmichael for a random curve E is O(log! n). 


Proof Let p,,..., px be the prime divisors of n which satisfy p; > log© n. We put 
the pair p1,p; into Lemma 5.5. We may assume that C > 7. The probability that 


2 
(1) holds true is << seb eet < log ?n. Using Lemma 2.12 we see that the 


1/3 2: 4 2 
probability that p;, i > 2 satisfies (2) is x 2 Werebem 4 Ws RUE which is 
Po Po 
also < log”? n. We see that the probability that (1) or (2) holds true for at least one 
index i is bounded above by <a 


logn’ 
Taking for P the set of prime divisors of n+ 1 in Lemma 3.1 and using Lemma 2.6 
we see that the probability that p;, i > 2 satisfies (3) is bounded above by 


logn logp;log,p; | log,n 
Se 
logsn —./P logp 


If p > log“ n, the first summand is negligible as soon as C > 2, while the second 
summand becomes < e~! provided that C is sufficiently large. Since the third 
condition of Lemma 5.5 depends only on the second prime, we see that these events 
are independent, and that the probability that each p; satisfies (3) is e~* < + 


— logn’* 


Lemma 5.9 Suppose that n has 5 prime divisors p,,...,pa, such that p; > log* n, 
and po,...,p4 > ev®8", or 3 prime divisors, such that p; > log‘ n, and p>, p3 > 


p°°!. Then the probability that n is elliptic Carmichael for a random curve E is 
< log 'n. 


Proof We argue as in the proof of the previous theorem. The probability that p; 
satisfies (1) or that p;, i > 2 satisfies (2) is < log”! n. In the first case the probability 
log4 n 


J/logn’ 
4 
probability that one specific p; satisfies (3) is << 1082" 


logn 


that one specific p;, i > 2 satisfies (3) is < while in the second case the 


. Since the probabilities for 
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different p; are independent we see that in each case the probability for the event 
that n is elliptic Carmichael is < log”! n. 


Lemma 5.10 There exists ac > 0, such that the following holds true. Suppose that 
n has = logs n prime divisors p Satisfying p > log?-“/? n. Then the probability that 
n is elliptic Carmichael for a random curve E is O(log” ' n). 


Proof Let c be as in Lemma 3.2. We may assume that at least half of the prime 
divisors p of n, which satisfy p > log?-“/ *‘n also satisfies p < log“ n, where 
C is as in Lemma 5.8, for otherwise we can apply the latter. Cut the interval 
[log?-*/? n, log n] into O(1) intervals of the form [x,x'+¢]. Then one of these 
intervals contains >> log3 n prime divisors of n, let [x,x'*¢] be the largest one of 
them. In particular the number of prime divisors p of n with p > x'*¢ is O(log} n). 
For each prime divisor p of n with x < p < x't° we have that p — a + 1 hasa 
prime divisor g > p!/?+¢ > x!/2+¢ with probability >> es > mee The 
probability that this prime is also a prime divisor of n+ 1is << log“ $n, hence with 


probability > 1— (1 — oe" > 1 — O(log”! n) we have that there exists 
2 3 
a prime q > x!/?*¢, which divides some p — a, + 1 for some p|n, but g + n+ 1. 
1 


But then n is Carmichael with probability < aa mar and our claim follows. 


We can now prove part 2 of Theorem 1.1. It follows from Lemmas 5.3, 5.8 


4 : : log s oe ‘ 
and 5.9, that either our claim is true, or n has > Cian prime divisors in the 


interval [c logn, log” “ n]. In the latter case there exists some interval [y, y + y!/?], 
clogn < y < log”©n, which contains > log re? n prime divisors of n. For each 
of these prime divisors the probability that p — a, + 1 happens to be another prime 
divisor of n is > log !/ 2+e/4 ny and for different prime divisors these probabilities 
are independent. We can therefore apply Cernov’s inequality to find that with 
probability > 1 —log™! n we have that there are > log’! 2n prime divisors p,..., Px 
of n, such that for each p; there exists some q;, such that pj = gq; — dg, + 1. 
In particular for each of these prime divisors p; we have that p;|n — a, + 1, 
since p; also divides n, we obtain a, = 1 (mod []p;). Pick three prime divisors 
clogn < r,12,7r3 < log” n of n, which are not among the g;, and such that a tn. If 
such prime numbers do not exist, we discard some of the q; in such a way, that the 
remainder still contains log’? n primes. 

If n is E-Carmichael, then a,,,,,, is uniquely determined modulo [| p; > 8 log’ n, 
hence this product is in fact uniquely determined. If a,,,.,, = 0, then our claim 
follows from Lemma 5.7. Otherwise if a,,,,,, is given, then (a,,,d;,,a,,;) can be 
chosen in 


logan 


c 
73(Grryr3) Ke Wesn 
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log rj logy r 


ways, and for each possible choice is realized with probability < The: Ta 


Hence the probability that n is E-Carmichael is bounded above by 


I log r; log, rj seen & log 3/2+€ n, 
i=1 Vi 


and the proof is complete. 


6 Proof of Theorem 1.1: The Probability for n and E Variable 


The proof of the third part of Theorem 1.1 is similar to the proof of the second 
part, but a lot easier, since we can dispose of integers with strange multiplicative 
behaviour immediately. 


Lemma 6.1 Let n € [x, 2x] and E be chosen at random. Then the probability that 
P+(n) > y and that n is E-Carmichael is O(x¢y~"/7). 


Proof The probability that n is divisible by P*(n)* is O(y—!), hence, we may 
neglect this case. Put p = Pt (n). Then we obtain p—a, + 1|n—a, + 1, subtracting 
Qn|p(P — ap + 1) from the right-hand side we obtain 


DP — 4p + I|n = panjp — Anfp + 1. (6.1) 


If the right-hand side is 0, thenn+1 = (p+ 1)apy/p. In particular a,;, = 1 (mod p), 
thus either a,/, = 1, orp < 2/n/p. In the first case we have n = p, thus n is prime, 
and therefore not Carmichael. In the second case we obtainn + 1 < (p + l)an/p < 
5,/n/p, hence P* (n) < 25. Since the number of integers n € [x, 2x] with P* (n) can 
be bounded by some power of log x, we may neglect this case as well, and assume 
from now on that the right-hand side of (6.1) is non-zero. 


If n — pan/p — An/p + 1 is a non-zero integer, then it has < Pare divisors. 
Hence for p and a,,, fixed, we have that among all possible choices for a, there 
are only x* satisfying (6.1). The probability for hitting one of these choices is < 

—!/2 log px « y~!/?x*, hence our claim follows. 


Lemma 6.2. Fix a squarefree integer d < x, which is not divisible by any prime 
number < log? x. Pick an integer n € [x,2x] and an elliptic curve E at random. 
Then the probability that d\n, (d,n/d) = 1, and that n is E-Carmichael is at most 
O(n 8 t€ 4 x€d-3). 


Proof Fix the curve E modulo d, and put H = ai dZ)). Then we have 
H|n — aganja + 1, thus anja is determined modulo a7 . Since (H, dyja)|n + 1, 


we conclude that a,/q is determined modulo ne ‘robability for the event 


(fant =" 
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d|n and (n+ 1,H) > VHis « mir + 1, if (n+ 1,H) < VH, there are < /7 
choices for a,/q, which could lead to Carmichael numbers. 

We next show that the number of choices for a,x, p*|\|2, which leads to a specific 
anja # Ois < x*. There are < logx prime factors of a,/g, distributing them over 
the possible a,x can be done in at most 


Clogx + Cet \ “reber 

eae 2 ( g aa | a (cles balesies: . 
log log x 

different ways. 

Fixing the values a,x, the number of choices for a, is at most k by Lemma 2.11, 
hence the number of Eo choices for dp, p|n/d given a,« for all p K\n/d is 
bounded above by max ,n2---ne, where the maximum is takes over all positive 
integers satisfying n) +--- +n, = Q(n/d) < a. and £ = w(d) « mis: 
Clearly this maximum is bounded above by 


logx ‘ log x log log log x 
< exp(C——_—————_ ). 
£log2 log log x 


We conclude that for any n,d the probability that H|n — aganjq + 1 subject to the 


condition dnjq # 0 and (H,n + 1) < VH is « “(Fe + {2 

If dyjq = 0, then H|n + 1, which is impossible for (H,d) > 1, and happens with 
probability < a + + otherwise. Summarizing we obtain that the probability for the 
event d|n, H|n — agdnjqa + lis & (oe + +), If we choose n € [x, 2x] at random, 
the probability for the event that d|n, H|n — agdnjqg + 1, and (H,n + 1) < VH is 

€ 1 1 

It remains to show that with high probability we have that H is large. 
Write d = pi---px, and put H; = |E(Z/p,---p;Z)|. Call a prime p; bad, if 
(Hi-1, |E(Z/piZ)|) > a It follows from Lemma 3.1 that either Hj; > x'/*, 


or the probability that p; is bad is bounded by OR) = = O(p; ~7/3). Hence 
1/4 


either H > x’/", or the probability that the product of the bad primes is > y 
is bounded above by x‘y~/?. Hence with probability > 1 — x‘y~?/? we obtain 
H > min(x!/4, (d/y)?/3). Splitting the interval for y dyadically we obtain that the 
probability for the event that n is Carmichael is bounded above by 


: 1 
x* max 


1 1 1 1 
—2/3 a es ee eee oe 
ost (ao adi 7a) “(amt yates) 


Denote by W(x, y) the number of integers n < x, such that P* (n) < y. The following 
is a consequence of Rankin’s trick, see [10, Theorem III.5.2]. 


Lemma 6.3 We have W(x, log? x) = x°/3+00), 
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We can now prove the third part of Theorem 1.1. Consider first the set of integers 
n which contain a divisor D; < d < D2, such that d is squarefree, coprime to n, and 
all prime divisors of d are larger than log? n. Using Lemma 6.2 we obtain that the 
probability that n satisfies this condition and is E-Carmichael for a random curve E 
is bounded above by 


xé >» (d ly Wag 1/2,. W24 gq 4/3) K x M8 + DI? x1? 4 Di), 
D\<d<D2 


If 2 does not possess such a divisor, then either Pt (n) > Dz, or we have J], p"™” > 


n/D,, where the product is taken over all prime divisors p of n, which are < log? n 
or satisfy v,(m) > 2. In the first case we can apply Lemma 6.1 to find that the 


probability that n is E-Carmichael is « x°D, '/2 Tn the second case we can write 
n = abc, where Pt (a) < log? x, b is powerful, and c < D;. Using Lemma 6.3 we 
see that for given c the number of possible choices for ab is < (x/c)*/>**. Summing 
over c we find that the number of possible choices for n is x/ 3tepi/ 3’ Hence the 
probability that a random n satisfies this condition is < (x/D,)~'/3+*. Summing up 
we find that the probability that a random n is E-Carmichael for a random curve E 


is bounded above by 


x(x 8 + Dix? +p," Dee? 4-0, "), 
In a wide range of parameters, e.g. for Dj) = 3/8 D, = x3/*, we have that the 
first term dominates the other terms, hence we conclude that the probability that n 
is E-Carmichael for n and E chosen at random is x~!/8*€. 
Note that the numerical value of the exponent can probably be improved, actually, 
we have no idea what the real value should be. It could well be something like 
—1-+ €, however, this would probably be hard to prove. 
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Integers 
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Abstract This paper provides a survey of results on the greatest prime factor, the 
number of distinct prime factors, the greatest squarefree factor and the greatest m- 
th powerfree part of a block of consecutive integers, both without any assumption 
and under assumption of the abc-conjecture. Finally we prove that the explicit abc- 
conjecture implies the Erdés—Woods conjecture for each k > 3. 


Keywords Block of integers * Greatest prime factor * Greatest squarefree divi- 
sor ¢ Number of prime factors * Powerfree part 
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1 Introduction 


Arithmetic properties of blocks of integers play an important role in various number 
theoretic questions. Let n and k be integers with n > k > 3 and consider N = 
n(n + 1)--- (n+ k—1). Lower bounds on the greatest prime factor and the number 
of distinct prime divisors of N are used in results on Diophantine equations of the 
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form N = y? where y > 1,q > | are integers (see, e.g., [17, 22, 35]) and on 
irreducibility of polynomials (see, e.g., [14, 15, 39]). It is a question of Erdés and 
Graham [13, p. 67], which disjoint blocks of positive integers each of length > 4 
have a perfect power as the product of their terms. The question was answered in the 
negative for more than two blocks by Ulas [45] and Bauer and Bennett [7]. See also 
[8, 28]. It happens for two blocks if and only if the squarefree parts of the products of 
the terms of both blocks are equal. Therefore estimates on the squarefree part of N 
restrict the range of possible solutions. An open conjecture of Erdés and Woods [12] 
states that there is a k such that there are no positive integers nj < n2 such that the 
integers nj +i and nz +i have exactly the same prime divisors fori = 0,1,...,k—1. 
In results on this question the greatest squarefree divisor plays a crucial role. 

In this paper we give a survey of results on these functions in the literature and 
add some new results. Notation is introduced in Sect. 2. In Sect.3 we summarize 
results on the greatest prime factor of N, in Sect. 4 those on the number of distinct 
prime factors on N. Sect. 5 deals with the greatest squarefree divisor of N and Sect. 6 
with the greatest m-th powerfree part of N. Sect. 7 contains results on the problem 
of Erdés and Woods. 

The abc-conjecture provides good insight into the true behaviour of the consid- 
ered functions. In Sect. 8 we mention the results which have been obtained under 
assumption of the abc-conjecture. In Sect. 9 we apply the explicit abc-conjecture to 
the Erd6s—Woods conjecture. Finally we refer in Sect. 10 to generalizations in the 
literature of the results mentioned in this survey. 

New in this paper are a lower bound for the greatest prime factor of N when 
n is very large compared to k, see Theorem 3.1, another approach to prove results 
on the greatest m-th powerfree part of N for given k under the abc-conjecture, a 
new estimate for the greatest prime factor of N for general n and k under the abc- 
conjecture, Theorem 8.2, and the proof that the explicit abc-conjecture implies the 
Erdés—Woods conjecture for each k > 3, Theorem 9.1. 


2 Notation 


We write P(x) for the greatest prime factor of the integer x, w(x) for the number of 
distinct prime factors of x, R(x) for the greatest squarefree factor (the ‘radical’) of 
x and Q,,,(x) for the greatest m-th powerfree part of x. We put P(1) = 1,@(1) = 0, 
R(1) = Q,,(1) = 1. Let 

N=n(n+1)---1+k-1) (2.1) 


and 


P(n,k) = P(N), o(n,k) = @(N), R(n, k) = RIN), Om(n, k) = On(N). (2.2) 
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We write exp, x for exp expx, log, x for log log x, and so on. We use the Vinogradov 
symbols < and > where the attached parameters indicate on which the constants 
involved depend. In a similar way we use the Landau symbols O(-) and o(-). 


3 The Greatest Prime Factor 


Here we only formulate the best available results. For the history and more details 
we refer to [23] and Sect. 1.1 of [38]. 


3.1 Complete Results 


Sylvester proved in 1892 that a product of k consecutive positive integers greater 
than k is divisible by a prime exceeding k. This has been refined by Laishram and 
Shorey [23] to 

P(n,k) > 1.8k forn > k 


unless (n,k) € — {(8,3), (6,4), (7,4), (15, 13), (16, 13), (4,3), (5, 4), (6, 5), 
(9, 8), (12, 11), (14, 13), (15, 14), (19, 18), (64, 63)}, to 


P(n,k) > 1.97k forn > k + 13 
and to 
P(n,k) > 2k forn > max(k + 13, 279k/262). 
Further they calculated all the exceptions to the inequality P(n,k) > 1.95k with 
n > k, but this list is too long to be reproduced here. They noted that the restrictions 
on n, k are necessary. For example, P(279, 262) < 2 x 262. 
Nair and Shorey [30] proved that 
P(n,k) > 4.42k forn > 4k 
and that 
P(n,k) > 4.5k forn > 4k and k 4 82, 83. 
This implies that for n > 100 


P(n,k) > 4.42k 


whenevern,n+1,...,2 +k —1 are all composite. 
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3.2 Asymptotic Results 


If n < k!? and n is sufficiently large, it follows from the results on the difference 
between consecutive primes (see, e.g., [3]) that there is a prime in [n,n + k — 1]. 
Hence P(n, k) > n whenever n is sufficiently large and n < k!? 

For n < k*-* with 0 < ¢ < 0.1 the best result up to now is due to Jia and Liu 
[20], viz. 


P(n, k) >_ KB, 
Harman [19, Chap. 6] proved 
PP, > B®, 


For kK? <n < exp(ci(logk)*/*/(log, k)'/*) for suitable c, the best available 
result is due to Sander [33], viz. 


P(n, k) >> ki te2(osk/ log)? 


for some positive constant co. 
For exp(c (logk)*/?/(log, k)'/?) < n < exp, ((log, k)?/(log, k)*) the best 
available bound is due to Shorey [34], viz. 
log, k 


P(n,k) > klogk ‘ (3.1) 
log, k 


For exp, (log, k)?/log,k)?) < n < exp,(log”k/ log, k) the best available 
bound is due to Ramachandra [31], viz. 


l 1/2 
P(n, k) > klogk (et) (3.2) 
log, k 


For n > exp,(log” k/ log, k) this is superseded by a result of the authors [36]: 
P(n,k) > klog,n. 


Combining the above results we see that (3.1) is valid for n > K3/2. 

According to Langevin [26] the constant involved in (3.2) can be arbitrarily close 
to 1 by letting n tend to oo. Here we shall derive the following sharpening by using 
an improved linear form estimate due to Matveev [29]. The proof is an extension of 
[42, Sect. 10] where the case k = 2 was treated. 
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Theorem 3.1 For k > 2,n > exp, k and n sufficiently large we have 


| 
P(n, k) > klog, n—=3~ 
logan 


Proof Suppose N = pi .--p* with distinct primes p1,..., ps and positive integers 


Then 


ieee (3.3) 


n 


jl j 
Py Pe 
hy hs 


P “+*Ds 


0< 


Set s; = w(n + i) for all i. Note that for each j with 0 < j < k at most w(n + j) + 
@(n+h) < 2s; primes have nonzero exponents. We apply the following special case 
of a linear form estimate. 


Lemma 3.2 (Matveev [29]) Let pi,...,ps be primes. Put 2 = []j_, log p;. Then 
there exists an absolute constant c3 such that the inequalities 


0< Ip? ---p’ —1| < exp(—e*2 log B) 


have no solution in rational integers b,,...,bs of absolute values not exceeding 
B>2. 
This gives 
er | 
a ae — 1] > exp(—e?"'Q log B) (3.4) 
Py eee Ps 


where 2 < (log P)?', B < ae It follows from (3.3) and (3.4) that 


1 1 k 
log ¢ Tost) & grass (log P)*". 
log, n logy n 
This implies 
log,n < s;log, P. 


Without loss of generality we may assume P < (log, n)°. Hence, for n sufficiently 
large, by n > exp, k, 


log 
log 


ee 2 on, 
4n 
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It follows that n + j forj = 0,...,k —1,j # h contains > 082" distinct prime 


logan 
factors > k. Since two such numbers n + j cannot both be divisible by the same 


prime > k, we find 


1 
w(n,k) > k 082" 


log,n 
Thus, by the Prime Number Theorem, 


| 
P(n, k) > klog, n— 3" 
log,n 


4 The Number of Distinct Prime Factors 

4.1 Complete Results 

For more details we refer to [22]. Sylvester’s earlier mentioned result implies that 
o(n,k) > (k) 


ifn > k. Let 6(k) = 2 for3 < k < 6,= 1 for7 < k < 16, and = 0 otherwise. 
Laishram and Shorey [23] improved upon earlier results by specifying all the finitely 
many exceptions to the inequality 


c(n, k) > w(k) + Ea —14+ 68). 
As corollaries they obtained 
If (nk) ¢ {(114, 109), (114, 113)}, then w(n,k) > 2(k) + Ea = 
and 
o(n, k) > min («w + Ea — 1+ 8(k), x(2k) — i) 
They further proved 


If (n,k) 4 (6, 4) and 12n > 17k, then w(n, k) > 2(2k). 


Again this is best possible, in view of w(6, 4) = 2(8)—1 and w(34, 24) = 2(48)—1. 
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4.2 Asymptotic Results 


Ramachandra et al. [32] proved that there exists a constant cy > O such that for 
n > exp(ca(log k)), 


w(n,k) > k. 


Shorey and Tijdeman [37, Theorem 5] showed that there exists a constant cs; > 0 
such that for k > c5,n > exp,(csk), 


w(n,k) > k + w(k) — 2. 


5 The Greatest Squarefree Divisor 


There are hardly estimates on R(n, k) in the literature, but the best approach seems 
to be the following. Combine the results of the previous section with estimates 
on the product of the first primes. For the complete results an explicit estimate is 
required, e.g. 

the product of the primes < r is less than 3’, see [18]. 

For the asymptotic results the best possible result reads 

the product of the primes < r is e1F°O)", 

which follows from the proof of the Prime Number Theorem. It follows that the 
product of the first ¢ primes is ¢¢+°)", Thus, as an example, from the first result of 
Sect. 4.2, we have for n > exp(ca(log k)), 


R(n,k) > fl rode 


6 The m-th Powerfree Part 


Khodzhaev [21] answered a question of Sprindzhuk by showing that there exists a 
positive constant cg such that there are only finitely many pairs of integers (n, k) 
with k < (logn) for which 


Qo(n,k) < kK. (6.1) 
More precisely, he showed that there exist positive constants c7, cg such that (6.1) 
does not hold when | < k < c7.,/n, but (6.1) holds for all k > cg /n. 


Sprindzuk [41] and Turk [44] gave bounds for the greatest m-free part of a 
polynomial f(x) € Z[x] at integers x. These bounds imply, for k > 3, 


Q>(n,k) Sex (logn)B~. 
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(cf. [41, p. 161], formula (2.10)) and for k > 3,m > 3, 
n(n, K) > 5 (log n)/Gn"oem), 


where cy > 0 depends only on k and ¢$(m) is the Euler function (cf. [41, p. 139], 
formula (1.2)). 

It follows from a result of Turk [44] on polynomial values that for k > 3,m => 2, 
n> 20 


P(On(n, k)) >k log, n 


and fork = 2,m > 3, 
P(Q,,(n, k)) > log3n. 


de Weger and van de Woestijne [9] considered 


sae 


They proved that for k > 2,m > 3 andalln e N 
Am(n, k) > km Cogny)/C"—), (6.2) 
On the other hand, they showed that there are infinitely many pairs k, m such that 
Am(A, K) Kem 0D, 


If k and m are odd, the exponent may be replaced by 1 — m/(km — 2). In Sect. 8 we 
argue that it is likely that the last two results are close to the best possible ones. 

In order to obtain a lower bound for Q,,(n, k) from (6.2) we first remark that (6.2) 
implies 


k-1 


IT On(n, k) >km (log n)/ On) 
i=0 


and then have to correct for the common factors. By an argument of Erdos [11] it 
suffices to divide by 


kyy ye 
ere <a (6.3) 


p<k 
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which depends only on k. It follows from (6.2) and (6.3) that 
Onn, k) > km (log n)X/Om—D 


The exponent of log is much better than in the earlier mentioned results of 
Sprindzuk. 


7 The Erdés—Woods Conjecture 


Erdés [12] conjectured that if m,n are positive integers there exists a k such that if 
for | <i < k the numbers m+ i andn +i have the same prime factors, then m = n. 
This problem is equivalent with a problem of J. Robinson in logic (Woods [46]). 
The only known examples of positive integers (m,n, k) with 1 < m<n,k > 2 and 


R(m + i) = R(n+ i) for0 <i<k (7.1) 

arise from 
R(2" — 2) = R(2"(2" — 2)), R(2” — 1) = R((2" — 1)”) (h= 2) 
and 
R(75) = R215) = 15, R(76) = R(1216) = 38. 
It is proved in [4] that if (7.1) holds then 
logk < (logmlog, m)'/? for k > 3, 
log(n — m) > k(logk)?/ log, k for k > 3, 


and 


log(n — m) > k(logk + log; n) log, y(log; y)~! for k > 27. 


8 Results Under the abc-Conjecture 


The well-known abc-conjecture reads as follows. 


Conjecture 1 For any given ¢ > 0 and coprime positive integers a, b, c satisfying 
a+b=cwe have 


c <, R(abc)'*®. 


In this section we give applications of this conjecture. 
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8.1 Estimates for Fixed k 


Langevin [27] (see also Granville [16]) proved that if the abc-conjecture is true 


and g(x) € Z|x] has no repeated roots, then for any ¢ > 0 we have Me a(n) P > 
|n|*2'9)—!—©. Tt follows from this result that under the abc-conjecture 
R(n, k) >« |n|AT*. (8.1) 


de Weger and van de Woestijne [9] showed that for any ¢ > 0,k > 2,m > 2 
under the abc-conjecture there are only finitely many integers n for which 


Am(n, k) < ni me (8.2) 


This is a very good counterpart to their unconditional results with exponent 1 — 
m/(km — 1) and 1 — m/(km — 2) mentioned in Sect. 6. It follows from (8.2) that 


k-1 


[]Qmlr + i) > nk were, 


i=0 


In order to obtain a lower bound for Q,,(n, k) we need to correct for the common 
factors. By the argument of Erdés (6.3) the common factor depends only on k. Thus, 
for any ¢ > 0, 


O(n, k) > kame WT, (8.3) 


Note that, apart from e, both (8.1) and (8.3) for k = 2 are the best possible. 
For (8.1) it suffices to observe that n can be a power of 2 and the other terms are at 
most n + k, for (8.3) that x = n+ 1,y = ncan be a solution of the Pell equation 
x —2y? = 1. 

We shall prove (8.1), (8.3) and a related result by a new method based on the 
following lemma which may have some independent interest. 


Lemma 8.1 Let k be a positive integer. Let x be a positive real variable. Then 


k k 
T] @+99- [TT] @4+90 == n+ O68), 


even i=0 odd i=0 


Proof The following formula holds for Stirling numbers of the second kind: 


k 
1 k-i k n 
Sik — Kl eae ('): f 
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We have S(n,n) = 1 and S(n,k) = 0 fork > n (see, e.g., [1, pp. 824-825]). It 
follows that, as x > 0, 


(k E dav gil 
seni) log(1 + ix) = reo(') o's = 
i=0 i=0 j=l 


CO ¢_4)\k-j-1,j _* 
3 — reo ()} i = —(k— bk + One). 
j=l i=0 


Hence 


[se, i=0 el + ix)() 


Ii dd eg (A+ ix)() —1=—(k- Db* + Oe). 


Since Yio) (4) = (1 —1)* = 0, we obtain, for x > oo, 


10% even, i=0 (x + i) () 


TE ao —1=-(k-1)le*¥ + 0O,@* ). 
i odd, i=o 1 YN 


Thus, using that SF jag. j= (“) = 2* 1, 


L 


k k 
[] @+ iy) — I] @+ NO = (hI) e* + OOTY), 


i even, i=0 i odd, i=0 


Theorem 8.2 Let k,m and n be integers withn > k > 2,m > 2. Then, assuming 
the abc-conjecture, for every € > 0 we have 


(a) On(n, k) > «km nk-l-wI- 8, 
(b) R(n,k) Dex 1, 
(c) P(n,k) = (k— 1+ og(1)) logn. 


Proof (a) We apply the abc-conjecture to 
1 k-1 = 1 k-1 wv 
PY o k—2_ 
> I] (tar) =— I] (n+ ii) = Ox(n? kL), 
even i=0 odd i=0 


where g is the greatest common divisor of the first two terms. By the argu- 
ment (6.3) of Erdés, g = O;,(1). Hence, by AB” = N <j n*, 


no = Ite a = I+e 
ne Kes (R(AB) -n™ —) ae (A ere eet) (8.4) 
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It follows that 


—¢_m_4k—2 
1? m1 2 2 


A > «km nk 
(b) It follows from (8.4) that 
gk—2 


R(AB) >ex nk '* 


(c) Since the product of the first / primes is e+°)”, we deduce from part (b) that 
P(n,k) => (k-— 1+ 0;(1)) logan. 


8.2 Estimates for General n,k 
8.2.1 Greatest Prime Factor 


Theorem 8.3 Let 0 < ¢ < 1/2 andn > k3/2. Then under the abc-conjecture, there 
exists a number k, depending only on € such that for k = k, we have 


1 
P(n,k) = G — e)klogn. 


Proof Let k => k,. Assume that 


P(n,k) < (; - :) klogn. (8.5) 


Then, for a suitable constant cio, by the Prime Number Theorem with remainder 
term, 


1 E 


(5 — e)klogn cioklogn a= 5)klogn 


on, k) = —— (8.6) 
logk+log,n (logk+log,n)? ~ logk+ log, n 


For 0 <i < k, write 

n+i= A;B i 
with P(A;) < k and gcd(B;, Thee p) = 1. For every prime p < k, let i, with 0 < i, < 
k be such that p appears to the maximum power in A,,. Let S; be the set obtained 


from {0, 1,...,k—1} by deleting i, for each prime p < k. Then |S;| > k—(k). By 
the combinatorial argument (6.3) of Erdés, 


TL 4is&. (8.7) 


AjES 
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Suppose that there exist at most (1 — 5)k number of i’s in S, such that 
Aj < BV, 


Then there are at least ek/8 number of i’s with Aj > k°/*. Consequently, from (8.7) 
we get 


KE as | [ai > (Gey = KK, 


i€S; 
a contradiction. Thus there exists a set Sy C S; with |S| > (1 — 4)k and 
Ai < K/* for ie Sp. 
Assume that there is at most one B; with i € Sz such that 


(4 - 3) logn 


Oh) Ss 
logk + log, n 


Then 


(5 — S)klogn é 
kb B) = 2A ((1- k= 1). 
an = Liat )2 logk + log, n ( 7 ) 


From (8.6) we get 


5 - LE)klogn - 5 — $)klogn 


logk + log,n ~ logk + log,n™ 


Thus there exist at least two B;’s, say B;, and B;,, with i), i2 € Sz, i; > in and 


(4 _ a) logn 


B;. < ———__,, 
o(Bi) s logk + log, n 


= 1,2. 


By (8.5) we get 


(4—§) logn l_i¢ 
~j< logktloggn << 2 3 
R(B;) < (klogn) Perea" < 1273, 


Applying the abc-conjecture to 


n+i =n+i4+ (ht —ho) 
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we get 


n< cy (R((n + i1)(n + i2))k)t? < cr (R(Ai Bi AnBn)k)'*? 


i —2é £ i € —é 
<en(ken® DME <eykeOtDnl 8, 


where c,; is a constant depending only on ¢. Therefore, 


6 102 
logn < = logen + (1+ =) logk < cy logk (8.8) 
€ & 2 


where cj2 depends only on ¢. By (3.1) and (8.8) we get 


log, k T 
P(n,k) > klogk 222 »> klogn 222". 
log, k log,n 


Hence the assertion follows. 


9 Proof of the Erdés—Woods Conjecture Under the Explicit 
abc-Conjecture 


As a special case of a result on the analogue of the Erdés—Woods conjecture 
for arithmetic progressions it was proved in [5] that under assumption of the 
abc-conjecture the original Erdés—Woods conjecture admits only finitely many 
exceptions for every k > 2. (See [5, Proposition 1].) For k = 2 there exist 
infinitely many solutions: for every integer h > 2 the pairs (2 — 2,2'(2" — 2)) 
and (2” — 1, (2" — 1)?) have the same prime divisors. 

An explicit version of the abc-conjecture is the following. 


Conjecture 2 (Baker, [2]) Let a,b, and c be pairwise coprime integers satisfying 
a+b=c. Then 


6 (log R(abc))?@? 


Laishram and Shorey [25] proved that a consequence of (9.1) is that 
c < (R(abc))"4. (9.2) 


Here we prove the complete Erdés—Woods conjecture for k > 3 under assump- 
tion of the explicit abc-conjecture. Of course, it suffices to prove it for k = 3. 
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Theorem 9.1 Assume Conjecture 2. Then there are no positive integers ny < no 
such that for i = 0,1, 2 the numbers n, +i and nz +i have the same prime divisors. 


Proof Apply Conjecture 2 with (9.2) in place of (9.1) to the equation (n2 + 1)? — 
n2(nz + 2) = 1. This yields 


ny < (R(n2(nz + 1)(m2 + 2)))'”. 


Observe that every prime p dividing n2(nz + 1)(m2 + 2) divides either ny — n, or 
(nz + 1) — (mn, + 1) or (m2 + 2) — (nm + 2). Thus 


R(n2(nz + 1)(n2 + 2)) < ng — 1 < np. 


Combining the inequalities we obtain 


2 1.75 
Ny < ny”, 


a contradiction. 


10 Generalizations 


More general results have been obtained for the greatest prime factor of the product 
of the terms of a finite arithmetical progression (see e.g. [24, 36]) and for the greatest 
prime factor of the value of a polynomial or binary form (see, e.g., [40]), for the 
number of distinct prime divisors of the product of the terms of a finite arithmetical 
progression (see, e.g., [22]) and of a polynomial value (see, e.g., [43]), for the 
greatest m-free part of a polynomial value (see [44]) and for the analogue of the 
Erdés—Woods conjecture for an arithmetical progression (see [5]). 

There are also more general results under the assumption that the abc-conjecture 
is true: for lower bounds for the greatest squarefree divisor of a polynomial value 
and a binary form value, see [10, 16, 27], for a lower bound for the greatest m-free 
part of a binary form value, see [27], and for the Erdds—Woods conjecture for an 
arithmetical progression, see [5]. 

Finally, for an estimate for the number of prime divisors of a product of consec- 
utive integers under the assumption that Schinzel’s Hypothesis H is true, see [6]. 
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The GCD of the Shifted Fibonacci Sequence 


Jiirgen Spilker 


Dedicated to my friend Wolfgang Schwarz 


Abstract Let f(n) be the Fibonacci-sequence defined by f(n+2) = f(n)+f(n+1), 
f(0) = 0,fC) = 1, and let t,(n) := ged(f(n)+s,f(n+1)+5s), s integer. In 2011 K.- 
W. Chen has proved that the function ¢,(7) is bounded if s exceeds 1. THEOREM: 
Let n and s be integers. (1) t,(n) divides s* + (—1)"; (2) ifm := s* —1 is not 0, 
then t,(n) is simply periodic; a period p is defined by f(p) = 0 mod m, f(p+ 1) = 
0 mod m. There are explicit formulas of t,(m) and generalisations to a wider class of 
recursive second-order sequences. 


Keywords Fibonacci sequence * Recurrences 


2010 Mathematics subject classification: Primary 11B39; Secondary 11B37 


1 Introduction 


The Fibonacci sequence f : No + No is defined by the recursion 
f(nt2)=f(n)+faatl), nENo (1.1) 


and the initial conditions f(0) = 0,f(1) = 1. Any two successive terms of the 
sequence are coprime, since by (1.1) every common divisor d > 0 of f(n + 1) and 
f(n + 2) divides f(n), then f(n — 1),...,f(1), sod = 1. 

In this note we consider the shifted sequence and define the greatest common 
divisor (gcd) 


t(n):= (f()+s,ftn4+ I 4+s5), neNo,se€Z. 
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The function 1 +> f9(n) is constant. The function n +> t;(n) is unbounded, since 
(f(4n—1)+ 1 f(4n) +1) =fQn-1=n, neN 

(in Lemma 4.1 replace n by 4n — 1 and specialize g(n) = f(n),m = —2,k = n). 

All other gcd-functions ¢,(7) with |s| > 1 “behave between these extremes”: they 

are bounded [1]. 


In this article we will show that they are in fact simply periodic. This result will 
be generalized to a class of functions f and general shifts 


ne’ (f(y+s,fntr)+, ms,t€Z,r>0. 
(Theorems 2.1 and 2.6). It is astonishing that these multiplicatively defined func- 
tions have such an additive property. 
If f(n) is the Fibonacci sequence, in Theorem 2.2 we give formulas for f¢,(n), 
and compare right- and left-shifts in Theorem 2.4. The theorems are in Sect. 2, the 


proofs in Sects. 3-8; they are elementary. 
The gcd of three shifted Fibonacci numbers is trivial: 


(fm) +s,f(nt+1)+s,fM@+2)+s)=1, n,seZ 
since every divisor d € N of these three integers divides 
fat+)+s—-G¢@+5) =fa-) 
and 
fm+2)+s—GFa@t+1 +s) =f, 


hence it divides (f(m — 1), f(m)) = landd = 1. 


2 The Theorems 


The periodicity of the gcd-function f,(m) is not only valid for the Fibonacci 
sequence, but also for a wider class of recursively defined functions. 


Theorem 2.1 Let b,a, B,s € Z, y := B? — a? —aBb and f : No > Z defined by 
fat DV =f—+hfO+, £0) =a,fC) =B. (2.1) 
Suppose that no integern € No satisfies f(n) = f(n + 1) = —s. Then 


t(n):= (fn+s,fr~+1+s), neENo, 
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is defined and we have 


(a) t,(n) divides bs? + (—1)"y,n € No; 
(b) ifm := \b’s* — y?| # 0, then the function n + t,(n) is simply periodic, i.e., 
there exists p € N so that t,(n + p) = t;(n),n € No. 


A period p < m’ can be chosen by 
Ff (p) = 0 mod m,f(p + 1) = 1 mod m. 


One can obtain formulas for ¢,(7). 
Theorem 2.2 Let f(n) be the Fibonacci sequence, s € Z and |s| > 1,k €N. 
(a) Iff(2k — i) 


a; mod (s* + 1),0 < a <s* + 1,i€ {0,1}, then 


t,(4k) => (a + sa, SAo —ay,s? + 1), 
t,(4k — 2) = (ap + (s — Lay, soy — (s + 1), 5* + 1). 


(b) If f(2k — i) = B; mod (s* — 1),0 < Bi < s? — 1,1 € {0, 1}, then 


t,(4k — 1) = ((s + 1)B1, (s — 1) Bo — sfi, s? — 1), 
ts(4k — 3) = (Bo — (s + 2)B1, (8 — 1)Bo — (s — I) B1.87 — 1). 


Periodical functions have a mean-value M. The next theorem contains two formulas 
for M. 


Theorem 2.3 Let p € N andt : No > Z satisfies tn + p) = t(n),n € No. 
Moreover, let q > 0 be the least common multiple of t(0),t(1),...,t(p — 1). Then 
M(t) := ; Pee t(n) satisfies 


M(t) = *y dh@, where I\(d) := #0 <n <p—1:t(n) =d} 
d\q 
and 
1 
M(t) ==) > p@h(d), where ln(d) := HO <n < p—1:d| t(n)}; 
d\q 


here y(n) is the Euler function. 


Example Let f(1) be the Fibonacci sequence and s € {—2,2}. Theorem 2.1 gives 
t,(n) € {1,3,5},n € No. Ina table of the first Fibonacci numbers one finds 


f(40) =0Omod 15, f(41) =1 mod 15. 
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So t;(n + 40) = t,(n), if n > 0, and with the help of Theorem 2.2, 


5 ifn = 6 mod 20 5 ifn = 16 mod 20 
h(n)= 43 ifn =1mod8 to(n)= 43 ifn=5mod8 
1 else, 1 else. 


In Theorem 2.3, take p = 40, g = 15. Then for tf = f andt = t_p, 


#215 aais 
2 .d=5 a= 

ia Ree 

= ge : ae 
6. 71 40° a= 


and M(t2) = M(t-2) = 2. 


In this example the gcd-functions f)(n) and t_2(n) have the same values on a period 
interval inclusive their multiplicity and they attain the upper bound s?+1 on the even 
numbers, resp. s? — 1 on the odd integers. This has the following generalization. 


Theorem 2.4 Let f(n) be the Fibonacci sequence, s € Z,|s| > 1 and p € 2N 
a period of t;(n). Then the sequences t;(2),t;(4),...,ts(p) and t-s(2), t-s(4), 
...,t-s(p) are permutations of each other. 


Remark The restriction to even integers is necessary, since t3(7) = 8, but t_3(n) 4 
8 for every n. 


Open Question We have M(t.) = M(t-2) = 2,M(m) = #,M(t-3) = &. For 
which s > 3 is M(t.) = M(t_s)? 


For every m € N there exists ani € N such that m | f(i) [2, Theorem 1]. Then 
e(m) := min{i € N: m | f(0} is called the entry point of m. 


Theorem 2.5 Let f(n) be the Fibonacci sequence and s € Z, |s| > 2. 


(a) The function n + t,(n) has the upper bound m = s* +1 as a value, iff the entry 
point e = e(m) is odd and s = +f(e + 1) mod m. 

(b) t,(n) has valuem = s* —1 on the odd integers iff s = 2, ore evenand s = 
—f(e + 1) mod m. 


Open Question Let f(n) be a solution of the general recursion 
fnt2)=af(n)+bf(n+1), abdeZ. 


Tf |a| > 1, is t.(n) still periodic? 


The periodicity is also valid for a more general gcd-function. 
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Theorem 2.6 Let f(n) be the Fibonacci sequence andr € N,s,t € Z,c := 
(—1)"*!s? + (f(r + 1) — f(r — 1))st — ?. Assume that f(n) = —s,f(n +r) = -t 
holds for no n € No, so that the function 


T(n) := (SA) +5,frt+r)+t), neNo 


is defined. Then 
(a) T(n) is a divisor of c + (—1)"f?(r) for alln € No. 
(b) If \c| 4 f?(”), the function n +> T(n) is simply periodic. A period 1 < p < m 
is found by 
S(p) =Omodm, f(p+1) =1modm, 


where m := |c? — f*(r)|. 


In the special case t = —s, there are formulas for 


ts—s = (f(r) + 5, f(n + I) — 5). 


Theorem 2.7 Let f(n) be the Fibonacci sequence, s € Z,|s| > 1,k EN. 
(a) Let f (2k + i) = a; mod (s* + 1),0 < aj < s* + 1,i € {1,2}. Then 


t,—s(4k + 1) = (a; + saz, soy — 2, 57 + 1), 
t, (4k + 3) = (ay + (8 + lata, sory + (s — 1)az, s* + 1). 


(b) Let f (2k + i) = B; mod (s? —1),0 < B; < s? —1,i€ {1,2}. Then 


ts-s(4k) = ((s + 1)B2 — Bi, (s— DBi.s8? — D, 
ts,—-s(4k + 2) = ((s + 1)B2, (s — 1)Bi — Bo, s* — 1). 


The proof is similar to that of Theorem 2.2. 


3 Proof of Theorem 2.1 


We use the matrices 


af Ol (fat) fa 
M:= (75) and Fay =( f(a) Fine 
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For everyn €N, 
F(n) = MF(n—1) =... = M""'F(1). 
Going over to determinants 
fat Df@-)-P?@ = (-D""FQFO) —f 0) = 1)"r. 
Let d € N be acommon divisor of f(n) + 5 and f(n + 1) + s. Then 


fat DFO 1 —bfa) -— PM) 


= —bs* mod d. 


fat Df(a-)-fP@) 


II 


Consequently d is a divisor of bs” + (—1)"y. This proves assertion a. 
The vector-valued mapping 


f(n) mod m 
wv i + 1) mod ») 


has only finite-many values. So there are integers p;, p2 such that 0 < p, < py < m 


and 


f(p:) =f(p2) modm, = f(pi1 +1) =f(p2 + LD mod m. 


By (2.1) we get f(p; + n) = f(p2 +n) mod m for every n > 0 and together with 
Pp := P2—p. <m’ we have 


f(n+p) =f () modm foralln € No. 
Since by assertion a t,(n) divides m, for every n € No, we finally get 


t(n) = (f(r) +s, f(n + I) + s,m) 
= (ftp) +s fat+1+p) + s,m) =t.(n+ p). 


4 Proof of Theorem 2.2 


The solutions f : No — Z of (1.1) can be continued in exactly one way to functions 
f:Z— Z still with property (1.1). If f(”) is the Fibonacci sequence, then 


f(-n) = (-1)"*'f@),  nENo. (4.1) 
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Lemma 4.1 If f,g : Z — Z are solutions of (1.1) and m,n,k € Z, then 
(Ff) + gmt I) f(n+ 1) —g(m)) = (f(n—2k) + (m+ 2k+ 1), f(n— 2k + 1)— g(m + 2k). 
Proof We use the following rules of the gcd 
(a,b) = (b,a) = (—a,b), (a,b) = (a + bc, b). 
Let 
t(n,m) := (f(n) + g(m + 1),f(n + 1) — g(m)). 
Then by (1.1) 
t(n,m) = (f(n) + g(m + 1), f(n— 1) — g(m + 2)) 
= (f(n— 2) + g(m + 3), f(a— 1) — g(m + 2)) = t(n—2,m + 2). 
Iterating this we get the assertion f(n,m) = t(n — 2k, m + 2k). 
In Lemma 4.1 set g(n) = sf(n),m = —2. Then 
t,(n) = (f(n — 2k) + sf (2k — 1), f(n— 2k + 1) — sf (2k —2)). 
First we consider the case n = 0 mod 4, say n = 4k. Then 
ts(4k) = (f(2k) + sf (2k — 1), f(2k + 1) — sf (2k — 2)) 
= (f(2k) + sf(2k — 1), —(s — I)f(2K) + (s + DF (2k — 1) 
= (f(2k) + sf(2k — 1), f(2k — 1) — sf (2k). 


Set a; := f(2k—i),i € {0, 1}. Then t,(4k) is a divisor of ap + sa; and a; — sao, hence 
of (ay + sa) —s(a, —say) = (s? + 1)apo and of s(ap + sa\) + (a, —Sao) = (s?+1)a. 
Since do and a, are coprime, f;(4k) divides s? + 1 (this is also a consequence of 
Theorem 2.1) and 


ts(4k) = (ao + say, a1 — Sao, s” + 1) 


= (a + sa), a) — S0t, S* + 1). 


This proves the assertion in the case n = 4k. The other cases n # 0 mod 4 are 
proved in the same way. 
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5 Proof of Theorem 2.3 


We have 


pM(t) = D> «n= 1d D2 1] => did) 


O<n<p d\q sees d\q 


and further 


ll 
M 
M 
M 


PM) = Yi 0M.g= >> }r YO 


0<n<, , O<n<p O<n<p 1 
of Ne \ Gage ria win) (42.4) 
~ ~ ~ d 
=| Yeo Y=] Vvot- 
riq 1 4 O0<n<p d\q Id 0<n<p 


Ir|t(n) |t(n) 


Since g(d) = > lld u()4, this is the second formula. 


6 Proof of Theorem 2.4 


If n € N is an even integer, then with (4.1), 


t(n) = (f(y) +5, f(n+ 1) +s—f(n) —s) = (f@) +5,f(n— 1) 
= (f(n-2) +5,f(m—1)) 
= (f@—2) +5,fla— 1) —f(n— 2) —s) = F(n— 2) + 5, f(a — 3) — 5) 
=(yenrt 2) +s frst 2)—s) 
= (f(—n + 2) —s,f(—n + 3) —s) = t_s(—n+ 2) 
= t-s(-n+2+4p). 


The mapping zm : nt —n+2+4 pis a bijection of the set {2,4,6,...,p} and 
ts = t_s o w. This proves the theorem. 
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7 Proof of Theorem 2.5 
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With the help of the rule f(m + n) = f(m + 1)f(n) + f(m)f(n — 1) we find that the 


integers f(1),f(2),... are modulo m equivalent to 


f(), F(2),--55 f(e— 1),0, 
f(e+ DFG), fle + DFQ),..-, fle + If(e— 1),0, 


Pet DFA) f7(e + DFQ),....f(e + If(e—1),0,... 


It follows that 


f(ke) =Omodm, f(ke+1)=f*(e+1)modm (k€N). (7.1) 
Next the congruences 
f(e+ ) =f) +fle- 1) =fle—1) mod m 
fle+ Dfle-)=f(e)+ (CI) =(-1)* modm 
give 
f-(e+ 1) = (-1)° mod m. (7.2) 
(a) Let t,(no) = m. Since t,(n) = (f(n) + s,f(n— 1)), we have 
f(n) +s =Omodm, f(no — 1) = 0 mod m. (7.3) 


By [2, Theorem 2.3], no — 1 is a multiple of e, say no = ke + 1 (k = O). 
Now nv is an even integer, since otherwise t,(ny) = s* + 1 would divide s* — 1 
(Theorem 2.1). Since k and e are odd, we get by (7.1) and (7.2) 


f(mo) = flke + 1) = f*(e + 1) = (-1)'2 fe + 1) mod m 


and s = f(e + 1) modm or s = —f(e + 1) mod m. Conversely, if s = f(e + 
1) mod mand ¢ is odd, then np := 3e+1 is by (7.1) and (7.2), a solution of (7.3), 
hence m divides t,(n9). By Theorem 2.1, t,(mo) divides m, so t,(mo) = m. If 
s = —f(e+ 1) modm, then np := e + 1 solves (7.3) and t,(n9) = m. This 
proves part a. 


(b) In an analogous manner one shows part b. 
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8 Proof of Theorem 2.6 


Let 


— (f@tl) fa _ ptt 
Fa = ( f(n) 7) und m= FW)= (1): 


Then 
F(n) = MF(n— 1) = M’F(n—2) =... = M" 'F(1) =M". 
Taking determinants gives 
fat Df@-)-fPa) =)", ne No. 
Further 


F(r+n) = M7" = MM = F(r)F(n). 


Comparing the 1-2-elements in the matrices gives 


fat ry =fr+ Di) +f~fa— I). 
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(8.1) 


(8.2) 


Let d € N be a common divisor of f(m) + s and f(n + r) + t. Then we have 


mod d 


s=—f(in), t=-frtr), st=fa@fnt+r). 


Together with (8.1) and (8.2) we compute mod d 


(r+) +f(r—1))st-—? 

=(f0 +) +fC-If@ -fa+nifatr 

= (FC —- DF —fOF@- DFC + DF + f~Fa— D) 

=f(r— DF + DPM —FMF@— DFWF(— 1) + 
Hf + D-fr— Df) 

=f(r— DET + DPM) —P O(a — Df(a+ YD 

=(PO+Y0VPO -POW?@ + -D") 

= (“D's -(-)'P?'), néENo 
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and finally 
c+ (-1)"f?(r) = 0 mod d. 


The second part of the theorem is proved like Theorem 2.2b. 
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On Liouville Numbers: Yet Another Application 
of Functional Analysis to Number Theory 


Jérn Steuding 


Wie dem Geiste nichts zu grof ist, so ist der Giite nichts zu klein. 
—Jean Paul 


Abstract In 1962, Erdés proved that every real number can be represented as a sum 
and as a product of two Liouville numbers. This has been generalized by Rieger and 
Schwarz. In this note we shall give an analysis of these results and their proofs. 
Moreover, we consider a certain subclass of Liouville numbers and prove similar 
results for this subclass. Since Wolfgang Schwarz had been very much interested in 
the history of mathematics, and the author shares this interest, he could not resist 
to include a few historical remarks (and footnotes) on transcendental numbers and 
Baire’s category theorem which might be interesting for the reader. 


Keywords Baire’s theorem * Liouville numbers 
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1 A Brief History of Transcendental Numbers 


In mathematics, the notion of transcendence dates at least back to Gottfried Leibniz. 
According to Eberhard Knobloch [50], Leibniz had used the term transcendental 
since 1673 for different mathematical objects; in particular, the failure of algebraic 
methods to solve polynomial equations of degree five or higher led him to investigate 
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the infinite by his differential calculus.' It was probably Leonhard Euler [29] in 
1748 who was the first to provide a proper definition of a transcendental number. 
In modern terminology, a number & is called transcendental if it is not algebraic. 
Here a number & is said to be algebraic if there exists a not identically vanishing 
polynomial with integer coefficients having € as zero; the minimal degree of such a 
polynomial is said to be the degree of &. In 1844, Joseph Liouville [54-56] proved 
his famous approximation theorem for algebraic numbers: for any algebraic number 
E of degree d > 1, there exists a positive constant c, depending only on &, such that 


> —_— 


P 
—&— — 
| q| 4 


for all rationals 2 with q > 0.* Liouville’s result has been significantly improved 
by Axel Thue, Freeman Dyson, Carl Ludwig Siegel, and, finally, Klaus Roth who 
succeeded in 1955 in showing that for every algebraic & and every € > 0, there exist 
only finitely many rational solutions : to the inequality 


1 


= gre 7 


I 
q 


independent of the degree of &. For this and further results of this flavour we refer 
to Hindry and Silverman [43]. 

Consequently, algebraic numbers cannot be approximated too good by rational 
numbers; moreover, numbers which allow better rational approximations are tran- 
scendental. This idea had been the motivation for Liouville for his studies on this 
topic in the period from 1840 to 1851; his main attempt to prove the transcendence 
of a or e := exp(1) failed, however, his approximation theorem had definitely 
influenced his protégé Charles Hermite [39] for his proof of the transcendence of e 
in 1873. In fact Hermite wrote? 


Dans cet ordre d’idées M. Liouville a déja obtenu un théoréme remarquable qui est I’ objet 
de son travail intitulé: Sur des classes trés-étendus de quantités dont Ja valeur n’est ni 
algébrique, ni méme réductible a des irrationelles algébrique, et je rappellerai aussi que 
Villustre géométre a démontré le premier la proposition qui est le sujet de ces recherches 
pour les cas de |’ équation du second degré et de I’ équation bicarrée. 


'Which he commented with his famous words “omnem rationem transcendunt”. 


?This final version appeared in [56]. His proof relies on an application of Rolle’s theorem to the 
minimal polynomial of &. 

Hermite [39, p. 77]; English translation: “In this context, M. Liouville already obtained a 
remarkable theorem which is object of his work entitled: Sur des classes trés-étendus de quantités 
dont la valeur n’est ni algébrique, ni méme réductible a des irrationelles algébrique, and I also 
remind that the illustrious geometer was the first to show the proposition which is subject of this 
research for the case of an equation of second degree and of an equation of biquadratic degree’. 
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Refining and extending Hermite’s method, Ferdinand Lindemann‘ proved that also 
x is transcendental and, henceforth, the impossibility of squaring the circle by using 
only ruler and compass. 

Liouville himself had been inspired by reading the correspondence between 
Christian Goldbach and Daniel Bernoulli published by Paul Heinrich von Fuss in 
1843. As a matter of fact, Goldbach wrote in a letter from August 1729”: 


Here follows a series of fractions like the ones you have asked me for whose sum is neither 
rational nor the root of any rational number: 


1 1 1 1 


ah a t 1.1 
To ' 100, 10000, 100000000 ' nD) 


The general term is 1/ 107°" 


A proof of this statement is missing in their correspondence (and we shall say a few 
words about that in the final section). It is interesting that Liouville later confused 
Daniel Bernoulli with Euler when he reflected his very first attempts®: 


Je crois me souvenir qu’on trouve un théoréme de ce genre, énoncé dans une lettre de 
Goldbach a Euler; mais je ne sache pas que la démonstration en ait jamais été donnée. 


We shall have a closer look on Liouville’s approach. A real number € is said to be 
a Liouville number if for every positive integer m there exist integers p,, and g», > 1 
such that 


1 


Pm 
a 
Im 


dm 


0< 


It is an immediate consequence of Liouville’s theorem and the latter definition that 
every Liouville number is transcendental. The name is chosen well: Liouville gave 
first explicit examples of such numbers by the following regular continued fraction 
expansions 


1 1 1 
at¢ta@atat 


“Who obtained his habilitation and venia legendi in 1877 at the University of Wiirzburg. 
Sef. Liitzen [57, p. 514]. 
SLiouville [56, p. 140]; English translation: “I think I can remember that one finds a theorem of 


this kind announced in a letter of Goldbach to Euler; however, I would not know whether its proof 
was ever given”. 
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with partial quotients a, recursively defined such that 


1 n 
lim sup Soot! =, (1.2) 


n>co 108 qn 


where the g, are the denominators of the convergents to & (see (4.1) below); for 
instance, setting a, = b™ with an integer b > 2 provides such an expansion. 
Another class of examples that Liouville mentioned (without proof), which can still 
be found in many textbooks, is given by extremely fast converging infinite series as, 
for example, 


f= >. Bab with arbitrary f, € {0,1,...,b—1}, 


n>1 


where again b > 2 is an integer. It therefore follows from Georg Cantor’s famous 
diagonal argument that there exist uncountably many Liouville numbers. 

Actually, when Cantor [15] started with his investigations on infinity in 1873, his 
focus was on algebraic numbers. His approach showed that (in modern language) the 
set of algebraic numbers is countable and its complement, the set of transcendental 
numbers, is uncountable; in fact, he wrote’ 


(...) so fand ich den deutlichen Unterschied zwischen einem sogenannten Continuum und 
einem Inbegriffe der Art der Gesammtheit aller reellen algebraischen Zahlen. 


This result came previous to Cantor’s famous and slightly easier statement on the 
uncountability of the set of real numbers. Therefore, the beginning of set theory 
relies on Cantor’s interest in number theoretical questions,*® however, the story 
is more subtle. According to Dauben [20], Cantor’s research of the early 1870s 
was devoted to the theory of trigonometric series and had a strong impact on 
his foundation of set theory; irrational numbers came into play with respect to 
exceptions of the convergence of a certain representation. 

For more details on the history of Liouville’s theorem we refer to Liitzen’s 
monograph [57]; for more details on transcendental numbers, see Burger and Tubbs 
[14] as well as Schneider’s classic [70].? 


7Cantor [15, p. 259]; English translation: “(...) so I found the significant difference between a 
so-called continuum and the epitome of all kind of real algebraic numbers.” 

8]t might be interesting to notice that Cantor’s doctorate entitled De aequationibus secundi gradus 
indeterminatis from 1867 dealed with quadratic forms and was supervised by Ernst Eduard 
Kummer. 

°Which was one of Wolfgang Schwarz’s favourite readings. Theodor Schneider was the thesis 
advisor of Wolfgang Schwarz; he is famous for solving the seventh Hilbert problem on the 
transcendence of a for algebraic 8 and a # 0,1 simultaneously with and independently to 
Aleksandar Gelfond in 1934. 
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2 The Papers by Erdos, Rieger, and Schwarz on Liouville 
Numbers 


In 1962, Paul Erdés [27] proved that every real number is the sum (and product) of 
two Liouville numbers. His paper starts with the following words!°: 


Some years ago I showed (possibly jointly with Mahler), that every real number is the sum 
of two Liouville numbers. A proof of the proposition may now be in the literature, but I 
do not know of any reference. In any case, the following slightly stronger theorem is now 
needed (see [1]), and therefore I publish a proof. 


Actually, [1] is a paper by Melzak on the algebraic closure of plane sets, published as 
[61] the same year and in the same journal as Erdés’ paper. Concerning the unclear 
statement of Erdés about the origin of his paper, there seems to be no reference 
in the literature before the publication of [27]. Erdés had been working with Kurt 
Mahler since the late 1930s.'! 

Here is Erdés’ stronger theorem in its original form: 


Theorem 2.1 (Erd6s [27]) To each real number t (t # 0) there correspond 
Liouville numbers x,y, u,v such that 


t=x+y= wv. 


Erdés’ gave a constructive elementary proof (using the binary expansion of real 
numbers) and another non-constructive one using a category argument which!” 


may be of interest because of its generality. The set L of Liouville numbers, being a dense 
set of type Gs, is residual (in other words, it is the complement of a set of first category). 
Let A and B be any two residual sets of real numbers. For each real number f¢, the set B, of 
numbers t—b (b € B) is also residual, and therefore it contains a point x of A. Let y = t—x. 
Then y € B, and since t = x + y, we have shown that each real number is the sum of a 
number in A and a number in B. We now obtain the first part of the theorem by choosing 
A=B=L. 


Obviously, the categorial reasoning holds for much more general sets. Notice that 


the complements of A and B, are both of first category, i.e., they are countable unions 
of nowhere dense sets, !? 


R\A=(J)m, R\B=(JMn. 


0Erd6s [27, p. 59]. 

'l As indicated by his warm-hearted obituary [28]. 

?Erdés [27, p. 60]. 

13 set M is said to be dense in a topological space X if every point of X is either an element of M 
or a limit point of M; a set is called nowhere dense if its closure has no interior points. 
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Hence, 


ANB, =()\(R\ Mn) O() (R\ Nn) 


is dense by Baire’s theorem and therefore not empty. We shall focus here on the 
additive representation and restate Erdés’ observation in a more general form: 


Theorem 2.2 (Erdés’ Theorem, Categorial Version) Let A,B C R be residual, 
then 


R=A+B. 


Here A + B is the so-called Minkowski sum defined as {a+ b : a€ A,b € B}. 
For the sake of completeness we shall briefly recall basic notions from topology 
and category theory. !* 


Theorem 2.3 (Baire’s Theorem [5]) Any countable intersection of dense open 
subsets (of a complete metric space) is dense. Equivalently, the complement of a 
meagre set is dense. 


A set M C Ris said to be of first category or meagre if it is a countable union of 
nowhere dense sets; otherwise, M is said to be of second category (or comeagre). 
The complement of a set of first category is called residual. By Baire’s theorem, 
every residual set is dense. A subset G is called a Gs-set if it is a countable 
intersection of open sets. Consequently, the countable intersection of dense Gs-sets 
is a dense Gs-set and every residual set is dense. 

In particular, this implies that the set 


e-mong(Uue-#44) 


nN 
n=1 \q>2 p q 


of Liouville numbers is a residual and dense G;-set. It is remarkable that L is 
big in the sense of category and small in the sense of measure (having Hausdorff 
and Lebesgue measure zero). For the set of normal numbers to all bases it is the 
other way around and this is attributed to Emile Borel [10] (concerning the measure 
theoretical statement)!° and Lars Olsen [63] (proving the categorial part). 


'4 very recommendable monograph on this topic is the book by Oxtoby [67] studying both 
subjects simultaneously. 

'S Actually, Borel’s original proof of the statement that almost all real numbers are normal to any 
base (in the sense of Lebesgue measure) was based on a strong law of large numbers and faulty; it 
has been corrected by Cantelli and is now known as the Borel—Cantelli lemma. The first rigorous 
proof of Borel’s statement about normal numbers is due to Georg Faber [30] and Felix Hausdorff 
[38], respectively. We refer to Barone and Novikoff [8] for details about the interesting polemic 
between Borel and Cantor’s pupil Felix Bernstein. 
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It seems that Erd6s’ interest in category theory started with his article [26] from 
1943 in which he refined a result by Wactaw Sierpinski [75] by showing, under 
assumption of the continuum hypothesis, the existence of an involutionf : R- R 
for which f(M) has measure zero if and only if M is of first category. '© 

In 1975, Georg Johann Rieger [68] generalized Erdés’ result on Liouville 
numbers by proving 


Theorem 2.4 (Rieger [68]) Suppose that f\,...,f, are real-valued continuous 
and strictly increasing functions defined on the closed unit interval; then there 
exist uncountably many Liouville numbers & in the open unit interval such that 
fié),....f-(€) are Liouville numbers too. 


His reasoning is elementary, the case r = | is constructive. Taking the functions 
tre f(t) = & -¢, resp. f(t) = &/t, yields Erdés’ Theorem 2.1. Furthermore, Rieger 
investigated the solvability of systems of real linear equations in Liouville numbers 
and proved that for every real t > 1 there exist Liouville numbers a, 6 such that 
t=a?, 

In 1977, Wolfgang Schwarz [71] gave another and in some sense final general- 
ization of both, Erdés’ and Rieger’s result by showing 


Theorem 2.5 (Schwarz [71]) Given (at most) countably many continuous open 
mappings f, : (0,1) ~ R,r = 1,2,..., there exists a dense Gs-subset D of 
(0, 1) consisting only of Liouville numbers & such that all f,(€), r = 1,2,... are 
again Liouville numbers. 


The non-constructive proof is inspired by the works of Erd6s and Rieger and relies 
once again on Baire’s theorem. The paper is very short, just three pages (and this 
is also approximately the number of pages of the preceding papers of Erdés and 
Rieger). Here is Schwarz’s elegant 


Proof The set L of Liouville numbers is a dense G;-set. The same is true for the 
intersection with B, := f,((0, 1)). In view of the continuity the pre-images A, := 
f-'(LN B,) are also Gs-sets, and because of f, being open and surjective, they are 
dense in (0, 1) as well. Thus, by Baire’s theorem, the countable intersection 


D = (LN@,1) Nf )A, 


is again a dense G;-set (and in particular uncountable); it follows from 
flD) C filr) Cif (EO By) C LOB, 


that f,(§) € L for every & € D. This proves the assertion. 


16See Oxtoby’s textbook [67, Chap. 19], for further details. 
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There are a few articles extending and generalizing the results of Erdés, Rieger, 
and Schwarz. We do not intend to give a complete list of those papers but mention 
Kamil Alniagik and Eric Saias [1] who showed that if7 C R is an interval with non- 
empty interior, G a Gs-subset of R, and (f,) a sequence of continuous and nowhere 
locally constant mappings f, : I > R, then (),,f'(G) is a Gs-subset of I. Edward 
Burger [13] gave an analogue for local fields. Senthil Kumar et al. [73] investigated 
questions around the algebraic independence of Liouville numbers and Schanuel’s 
conjecture in the framework of G3-sets. 

Applications of functional analysis to number theory were one of the major 
themes in the research of Wolfgang Schwarz. Besides the example from above 
there should be mentioned the application of Gelfand theory of commutative Banach 
algebras to arithmetical functions; see [46] for an overview. 


3 A Brief History of Baire’s Theorem and Related Topics 


Building on concepts and definitions established by Cantor and Paul du Bois- 
Reymond in the course of foundation of set theory and analysis, René-Louis Baire 
proved in his thesis from 1899 his famous category theorem. In modern language 
this statement can be formulated as follows: 


Theorem 3.1 (Baire’s Theorem, Modern Version) /f X is a complete metric 
space and § a countable union of nowhere dense sets in X, then the complement 
of S is dense in X. 


A space is said to be Baire if each countable intersection of dense open subsets is 
dense. Hence, Baire’s theorem implies that all completely metrizable spaces and all 
compact Hausdorff spaces are Baire. 

It took about 20 years that Baire’s theorem was extended to other complete metric 
spaces. Nowadays Baire’s theorem is interpreted as the origin for three major results 
in functional analysis, namely the uniform boundedness principle (first proved by 
Stanistaw Saks 1929 building on previous work of Henri Lebesgue 1909), the 
open mapping theorem (due to Stefan Banach and Juliusz Schauder 1929), and 
the closed graph theorem. A typical corollary is, for instance, due to Banach [6] 
and Stefan Mazurkiewicz [60] who deduced in 1931 from Baire’s theorem that the 
generic continuous function on an arbitrary compact interval is not monotone on 
any subinterval and nowhere differentiable."" In fact, Baire’s approach often allows 
to prove the existence of a function with certain counter-intuitive properties as, for 
example, the Weierstrass’ monsters of nowhere differentiable continuous functions, 
whereas the explicit construction might be difficult.'® 


"Building on previous work by Hugo Steinhaus [78]. 
'8See Volkert [82] for a historical discussion of Weierstrass’ construction and its generalization. 
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We shall have a closer look on the origin of Baire’s work. As a matter of fact, 
Baire’s theorem is just a more precise formulation of Cantor’s observation on the 
uncountability of the real numbers, and indeed Cantor’s first proof of this statement 
is built on essentially the same ideas which are used to prove Baire’s category 
theorem. Therefore, we shall also take Cantor into account and indicate how his 
work inspired Baire. 

Much has been written about Cantor’s path-breaking foundation of set theory in 
the 1870s. Mathematics and mathematical thinking at his time was rather different 
from what they are today. In particular, the notion of infinity was used in an 
imprecise way and infinite sets were considered countable (to use modern language). 
Carl Friedrich Gau8 wrote in a letter to his friend Heinrich Christian Schumacher’: 


(...) so protestiere ich (...) gegen den Gebrauch einer unendlichen Grdésse als einer 
Vollendeten, welcher in der Mathematik niemals erlaubt ist. Das Unendliche ist nur eine 
facon de parler. 


It is indeed the new concept of infinity which stands at the beginning of Cantor’s 
work and Baire’s theorem. Pierre Dugac wrote about Baire”?: 


D’ailleurs, ce sera BAIRE qui utilisera pour la premiére fois en France, de fagon systéma- 
tique, les travaux de GEORG CANTOR, aussi bien ceux concernant la topologie générale 
que ceux traitant du transfini. A. DENJOY nous disait que ‘ga ne fait pas de doute’, et 
que ‘l’influence de CANTOR sur RENE BAIRE a été considérable’. En particulier, c’est par 
le transfini ‘que Cantor est continuellement présent dans l’oeuvre de Baire’. Par contre, 
‘jamais Baire n’a fait appel a l’hypothése que les nombres réels forment un ensemble bien 
ordonné. Il s’est toujour tenu au dénombrable 


And Oliver Deiser wrote?!: 


Die nirgends dichte Cantormenge hatte Cantor bereits 1883 betrachtet, weiter kann man 
seinen ersten brieflich mit Dedekind diskutierten Beweis der Uberabzihlbarkeit von R 
als einen Beweis des Kategoriensatzes lesen. Aber erst Baire hat die Begriffe Klar 
herausgestellt. 


‘Sef. Ilgauds and Purkert [45, p. 47], resp. Maor [59, p. 55]; English translation: “(...) so I protest 
against using an infinite quantity as accomplished which is nowhere allowed in mathematics. The 
infinite is only a fagon de parler.”’ Here the word ‘fagon de parler’ can be translated literally as 
‘manner of speaking’. 


0Dugac [23, p. 317]; English translation: “By the way, there is BAIRE who was the first in France 
to use GEORG CANTOR’s works in a systematic way in exactly the general topology of infinity. 
A. DENJOY told us that ‘there is no doubt’ and that ‘CANTOR’s influence on RENE BAIRE was 
substantial’. In particular when it is about infinity “Cantor is present in Baire’s work’. On the 
contrary ‘Baire himself never appealed to the hypothesis that the real numbers form a well-ordered 
set. He always believed them to be countable”. 

2!Deiser [21, p. 494]; English translation: “The nowhere dense ({middle third] Cantor set had 
already been considered by Cantor in 1883, moreover, one can read his proof of the uncountability 
of R, first discussed with Dedekind in a letter, as a proof of the category theorem. However, it was 
Baire to emphasize the concept”. 
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A historical comment is in order. It was Henry John Stephen Smith [77] who was the 
first to study the middle third Cantor set?’ in 1874, almost 10 years before Cantor 
[16], Vito Volterra [83] in 1881, and du Bois-Reymond [22] in 1880.73 

In order to indicate the very origin of Baire’s theorem we shall sketch the proof 
of Baire’s theorem (in the shape of Theorem 2.3 for R) in a nutshell: Given dense 
open subsets D,D>,... of R, also (\,<,,<,, Dn is open and dense. Let {J,} be an 
enumeration of rational intervals and let Jp 4 @ be an arbitrary open interval. Define 
Thai = Je = (Lx, re), where k is the least positive integer such that [€;, rg] C InNDn. 
It follows that 


€:= lim & € 17 (\Pn 


n>1 


which shows that the intersection of countably many open sets is dense in R. 
Indeed, Cantor wrote to Richard Dedekind in a letter from December 1873”: 


Es lasst sich nun stets wenigstens eine Zahl, ich will sie 7 nennen, denken, welche im Innern 
eines jeden dieser Intervalle liegt; von dieser Zahl 7 (...) sieht man rasch, da sie in keiner 
unserer Reihen (...) enthalten sein kann. (...) folglich ist die Voraussetzung eine unrichtige 
gewesen. 


The proof with the famous diagonal argument was given by Cantor much later, 
namely in 1890 and appeared in the first issue of the journal of the just founded 
Deutschen Mathematiker Vereinigung” [18]. 

In the beginning, when René-Louis Baire [5] proved a first version of his 
famous theorem, it was a statement about the set of discontinuities of a function of 
several variables which is continuous with respect to each variable separately. Baire 
succeeded to prove that such (not necessarily) continuous functions are pointwise 
discontinuous on each perfect set. In fact, Baire’s theorem explains how infinitely 
real differentiable functions differ from analytic functions. However, already in 
1896, William Osgood [64, 65] obtained the statement for functions of one variable. 
About Baire’s contribution Edward B. van Vleck wrote”°: 


BAIRE in his important thesis has given a number of interesting theorems concerning 
pointwise discontinuous functions. For their demonstration he employs the concept of 
semi- (i.e., upper or lower) continuity. However necessary the concept may be found for 


2Which is constructed by removing each middle third of the unit interval and all remaining 
intervals. 

?3For this and the rather difficult relation between the latter mathematician and Cantor see Laugwitz 
[53]. 

24Cavaillés and Noether [19]; English translation: “Now we can think of a number, I will call it 7, 
which lies in the interior of all these intervals; it follows easily that this number 7 is not contained 
in any of our sequences, henceforth our assumptions was false”. 

?5German Mathematicians Association; Cantor was one of the founders of this society in 1890. 
6van Vleck [79, p. 189]; in the original the names of Osgood and Baire are written in small capitals 
as well. 
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subsequent portions of his investigation, its introduction is not needed for the particular 
theorems referred to. When the unnecessary element is removed, the principles of their 
demonstration, although very different in form, become equivalent to those used by 
OSGOOD in the derivation of his theorem relating to the convergence of series of continuous 
functions. (...) It is found that BAIRE has restricted f(x, y) more than is necessary in the 
following theorem: (...) 


Osgood and van Vleck were at that time two of the many young and talented U.S. 
American mathematicians studying at well-established European universities’ and 
later founding a strong American school of analysis. Osgood [66] was the first to 
prove the Riemann mapping theorem in its full generality (for simply connected 
domains with arbitrary boundaries). And van Vleck [80] was very close to prove 
the Borel attributed theorem that almost all real numbers are normal, mentioned 
above. There he also established a first zero-one law previous to the attempts of 
Borel, Cantelli, and others.** The following stupid words should not be hidden in a 
footnote: during World War I van Vleck wrote about the Lebesgue integral that”? 


This new integral is proving itself a wonderful tool. I might compare it to a modern Krupp 
gun, so easily does it penetrate barriers which before were impregnable. 


Actually, van Vleck was rather influential, serving as an editor of the Transactions of 
the American Mathematical Society from 1905 to 1910, as vice-president in 1909, 
and, finally, president in the years 1913 and 1914. 

We return to Baire’s theorem. Interestingly, there had been another forerunner 
apart from Osgood. As early as 1884 Ludwig Scheeffer used a similar reasoning in 
his discussion of continuous functions. In his article [69] he proved among other 
things the following remarkable Hiilfssatz aus der Mengenlehre*°: 


Es sei P eine beliebige perfekte Menge, welche in keinem Intervall tiberall dicht ist, R 
eine beliebige abzihlbare Menge. Subtrahirt man von allen Elementen der Menge P eine 
Konstante a, so entsteht eine aus den Werten x — a gebildete neue Menge Pz. Es lasst sich 
dann die Konstante a zwischen beliebig gegebenen Grenzen immer so bestimmen, dass die 
Menge P, keinen einzigen Wert von der Menge R enthalt. 


In modern language this means that if R is a countable set of real numbers and 
P C R is nowhere dense, then there exists an everywhere dense set A such that 
R+aMPis empty (whereR+ a:= {r+a:reR}). 

Scheeffer’s result has been extended by Frederick Bagemihl [2] who showed 
that the statement of the Hiilfssatz is even true if P is of first category, i.e., if R 
is countable and P is of first category, then there exists a residual set A such that 


27% the case of Osgood and van Vleck both studied and worked with Felix Klein in Gottingen. 
8See Barone and Novikoff [7] for details. 

°van Vleck [81, p. 7]. 

°Scheeffer [69, p. 291]; English translation: “Let P be an arbitrary perfect set which is not dense 
in any interval, and R an arbitrary countable set. Subtracting a constant a from all elements of P, 
the values x— a form a new set P,. Then one can always determine the constant a in between given 
bounds such that the set P, contains no value from the set R”. The word ‘Hiilfssatz’ is old-fashioned 
German for a proposition. 
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R+ aN P is empty for every a € A. The proof reminds us on Erdés’ reasoning 
in his categorial proof of Theorem 2.1.*! This observation may be used to prove, 
for example, the existence of real numbers r such that translates of the middle third 
Cantor set by r, C + r, are disjoint with the set of rational numbers; notice that C is 
of first category.** 

Scheeffer was a pupil of Cantor who also wrote an obituary [17] on him after 
his untimely death in 1885 caused by typhus; there one can read about Scheeffer’s 
work: 


Eine griindliche Gelehrsamkeit, verbunden mit Reichtum an eigenen Gedanken, welche mit 
musterhafter Einfachheit, Klarheit und Eleganz der Sprache zur Darstellung kommen, bildet 
das wesentliche Merkmal seiner Produktionen. Von den sch6nen Resultaten, zu welchen 
ihn seine mit ei-sernem Fleif gepflogenen Untersuchungen gefiihrt haben, mége hier der in 
Nr. 4 gelieferte Beweis fiir den folgenden Satz hervorgehoben werden: 

“Wenn man von einer stetigen Funktion einer Verdnderlichen wei}, dafi ihr Differentialquo- 
tient Null ist fiir alle Werte eines Intervalls, mit Ausnahme derjenigen, welche irgendeiner 
gegebenen unendlichen Punktmenge erster Mdchtigkeit entsprechen, so folgt hieraus, daB 
die Funktion in dem gedachten Intervall eine Konstante ist.” 


This theorem reminds us strongly of results obtained by the Polish school of 
functional analysts around Banach mentioned in the beginning of this section. 
Such results are indeed typical for applications of Baire’s theorem or, to include 
Scheeffer’s statement, categorial reasoning built on Cantor’s set theory. The notion 
‘erste Mdchtigkeit’ characterizes here a discrete set of real numbers, a concept 
introduced by Axel Harnack [36] to relate point-sets with intervals that has been 
driven further by Borel and others, being the backbone of Lebesgue theory.** 


31t is remarkable that Bagemihl’s paper [2] was published in the same journal as Erdés’ article 
[27] on Liouville numbers about a decade earlier; it might be even more remarkable that in this 
second issue of the Michigan Mathematical Journal there is a paper by Gerald Mac Lane [58] 
on a conjecture by Erdés and others published on the previous pages. However, taking more than 
1500 publications of Erd6és and their deep impact into account this might have no meaning at all. 
Scheeffer’s theorem is also mentioned in the at that time rather influential monograph [76] by 
Sierpinski. 

?The author has learned about Scheeffer’s work and all the other details mentioned above by 
the blog mathoverflow and the valuable comments of Andres Caicedo on the question entitled ‘A 
translation of the Cantor set contained in the irrationals’, posted February 22, 2014, and based on 
Wilman Brito’s book [11]. 

33Cantor [17, p. 199]; engl. translation: “A sound scholarliness connected with plenty of own 
thoughts, which with exemplary simplicity, clearness, and elegant language are constituted, are 
the essential character of his productions. Of his beautiful results, to which he was guided by his 
brassbound diligence driven investigations, I would like to accent here the proof of the following 
theorem from No. 4: 

“If one knows about a continuous function of one variable that its differential quotient is zero 
for all values of an interval with exception of those which constitute an arbitrarily given infinite set 
of first cardinality, then the function is a constant in this interval.” The words in italics are also in 
the original in italics. 
34Scheeffer [69] uses with ‘Inhalt Null’ (meaning ‘content zero’) another notion introduced by 
Cantor in place of ‘erster Machtigkeit’. 
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The paper in question, Nr. 4 in Cantor’s list of Scheeffer’s publications, is [69], 
mentioned above containing the forerunner of Baire’s theorem. 

Actually, Baire was aware of Scheeffer’s work. In a letter to Volterra, dated 
March 8, 1899, Baire wrote about an article by Dmitry Gravé [32] that*: 


Sa note ne me semble pas trés originale, car l’exemple qui’il donne n’est pas autre 
chose qu’un exemple donné en 1885 par Ludwig Scheeffer dans un mémoire des Acta 
Mathematica (Tome V), et auquel, d’ ailleurs, je renvoie dans le dernier chapitre de ma thése. 
L’exposition seule différe, et je me demande s’il peut y avoir 1a une simple coincidence. 
Cette note a été présentée par M. Picard, a son retour de voyage; je mais proposais de lui en 
dire un mot, mais j’ai complétement oublié 4 la derniére visite que je lui ai faite; je lui en 
parlerai une autre fois. 


The thesis mentioned here is, of course, Baire’s thesis, later published as [5], and 
the mathematician behind M. Picard is, of course, Monsieur Emile Picard. 

There is another episode to be told in this context, namely the aspect of 
constructive proofs and a related dispute. It is well known that Cantor’s former 
promoter Leopold Kronecker became a strong opponent to Cantor’s modern set 
theory; his famous words*© 


Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist Menschenwerk. 


give proof of his clear position in favour of constructive methods and against 
infinite constructions. It was the young generation of mathematicians who cherished 
Cantor’s foundation of modern set theory; for instance, David Hilbert who wrote?’: 


Aus dem Paradies, das Cantor uns geschaffen, soll uns niemand vertreiben kénnen. 


Moreover, at the very first International Congress for Mathematicians, held at Zurich 
in 1897, Adolf Hurwitz [44] gave a talk on recent developments in complex analysis 
and the impact of Cantor’s set theory. 

A rather explicit example of the two contrary positions of Cantor and Kronecker 
seems to give different proofs of the existence of transcendental numbers. Whereas 
Liouville’s proof is constructive Cantor’s reasoning is different, at least on first 
glimpse non-constructive and therefore inacceptable by Kronecker and his fellows. 
It was first observed by Israel Nathan Herstein and Irving Kaplansky [41] in 1974 
that contrary to that what is written in many textbooks Cantor’s diagonal argument 


5Dugac [23, p. 349]; English translation: “His note does not seem to be very original, in particular 
since his example had been given 1885 by Ludwig Scheeffer in his treatise in Acta Mathematica 
(Tome V), and to which I refer to in the last chapter of my thesis. Only the presentation differs and I 
ask myself whether there could have been a simple coincidence. The note had been communicated 
by M. Picard after the return from his journey; I could offer to talk to him, however, I had 
completely forgotten at my last visit; I could talk to him about that another time.” In the author’s 
opinion there are different interpretations of the notion ‘je me demande s’il peut y avoir 1a une 
simple coincidence’. possible here. 


36cf, Weber [84, p. 15]; English translation: “God made natural numbers, all else is the work of 
man”. 

37Hilbert [42, p. 170]; English translation: “No one shall expel us from the paradise that Cantor 
has created”. 
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is in some sense constructive, and it was Richard Gray [33] who computed by a 
detailed analysis the complexity with which ‘Cantor’s methods lead to computer 
programs that generate transcendentals’. Nevertheless, we observe a difference with 
respect to representability in the examples of transcendental numbers provided 
by Liouville’s method versus the reasoning of Cantor and Gray. Note that Erdés’ 
reasoning for Theorem 2.1 based on Baire’s theorem is obviously just an existence 
proof and not constructive. 

In 1904, Ernst Zermelo [85]** introduced the axiom of choice AC in order to 
prove Cantor’s conjecture on the well-ordering of arbitrary sets for which Zermelo 
is sometimes considered as the true founder of abstract set theory.*” Often AC plays 
a substantial role in proving that a space is Baire. However, sometimes a weaker 
version of AC is sufficient, e.g., the principle of dependent choices DC which states 
that, for every pair (X, 0), where X is a non-empty set and p is a relation on X such 
that for all x € X there is some y € X such that x py, there exists a sequence (x,) in X 
with x, 0 X,+1 for each n. One can show that DC is equivalent to compact Hausdorff 
spaces being Baire (as was shown by Charles E. Blair [9]). 

Baire and contemporary French analysts were not against non-constructive 
proofs or Cantor’s set theory, in the discussion about AC, however, they turned out 
to be rather conservative; according to Akihiro Kanamori [48]*°: 


Baire [1899:36] viewed the infinite ordinal numbers and hence his function hierarchy as 
merely une facon de parler, and continued to view infinite concepts only in potentiality. 
Borel [1898] took a pragmatic approach and seemed to accept the countable ordinal 
numbers. Lebesgue was more equi—vocal but still accepting; recalling Cantor’s early attitude 
Lebesgue regarded the ordinal numbers as an indexing system, “symbols” for classes, but 
nontheless he worked out their basic properties, even providing a formulation [1905:149] 
of proof by transfinite induction. All three analysts expressed misgivings about AC and its 
use in Zermelo’s proof. 


Indeed, from a modern point of view their aloofness to abstract set theory is 
surprising and may be even considered ironic with respect to their contributions and 
what grew out of them.*! In particular Baire expressed his disapproving position 
with respect to AC with a drastic racist statement”: 


It is surprising that it is the Germanic races, with a heavy and correct spirit, that throw up 
these rash assertions; it would perhaps be more comprehensible among the hare-brained 
Latins. 


38It might be interesting to notice that his article has the subtitle “Aus einem an Herrn Hilbert 
gerichteten Briefe’, and that he wrote further “Die Idee (....) verdanke ich Herrn Erhard Schmidt”. 
meaning that the article contains material from a letter written to Hilbert and that the idea behind 
is due to Schmidt. 

39Hence, Zermelo rather than Cantor should be regarded as the creator of abstract set theory” 
wrote Kanamori [48, p. 15]. 

40The appearing brackets point to publications of the three authors indicating the year and page. 


4! For a change of thinking credit should be given to, among others, Hausdorff and the Polish school 
of functional analysts. 


“cf. Gray [34, p. 258]. 
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For more details about Baire’s life, his uneasy character and the quarrels with 
Henri Lebesgue and Charles de la Vallée Poussin as well as an overview on the 
various applications of Baire’s theorem we refer to Dugac [23] and Jones [47]. A 
good reading about Cantor’s life and work is the monography of Ilgauds and Purkert 
[45] and we refer to Gispert [31] for details about the reception of Cantor’s set theory 
in France. The history of the axiom of choice and its mathematical impact has been 
treated by Moore [62] and more recently by Herrlich [40]. 


4 Imposing Residue Class Constraints: New Results 


Given positive integers a < b,c < d, we say that an irrational real number & is 
a restricted Liouville number with respect to the pair of residue classes (a mod 
b,c mod 4d) if for infinitely many positive integers m there exist pairs of integers 
(Pm, Im) € (a mod b,c mod d) with positive g,, such that 


P 1. 
E at —; 
Ym Gin 
we denote by L(4 mod fy the set of all such restricted Liouville numbers &. 


Theorem 4.1 For every choice of positive integers a,b,c,d with coprime a and c 


the set L a mod f) is an uncountable subset of L. 


Proof We may assume that b,d > 2. We expand ¢ into a finite regular continued 
fraction 


1 1 1 


— — . ... m=: |do,d),...,4 
a, + a2 + + am ml 


a 
—-=adayt+ 
Cc 


with positive integer partial quotients a,. We shall extend this expansion by contin- 
uing this fraction further. Recall the recursion formulae for continued fractions, 


P-1 = 1, po = 40 and = Pn = AnPn—1 + Pn-2; 
(née N) (4.1) 

q-1 = 0, qo = 1 and dn = 4nGn-1 + qn-2:- 
In view of the coprimality we have py», = a and qn = c. Let a, = 05” with £ := 


Icm[b, d] for all indices n > m satisfying n = m mod 2 and a strictly increasing 
integer-valued arithmetical function 6 : N > N. Then, we obtain by induction 


Pm+2k = Gm+2kPm+2k—-1 + Pm+2(k-1) = a mod Bb, 


Am+2k = Am+2kYm4+-2k-1 + Im+2(k-1) = C mod d 
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for k € N. So far there is no constraint on the partial quotients a, with index n > m 
ifn # m mod 2. Assigning arbitrary positive integer values to the partial quotients 
a, for these indices n, we may define real numbers by the corresponding continued 
fraction expansion 


& := &(a,d) = [LE],a1,...,dm,Antis---, n+l 


We notice that, by construction, every second convergent meets the residue class 
conditions. 

Next we shall specify the arithmetical function 6 in such a way that & is a 
Liouville number. For this purpose recall the classical estimate 


ale 
Gn| — Qn41q 
Hence, & is a Liouville number if for all 
Qe? < any = OOO) forall n>m,n+1=mmod 2. (4.2) 


In fact, we define 6(m + 1) inductively such that this inequality is satisfied as 
Liouville did with (1.2). For the remaining partial quotients d@,,41,@m+43,... We 
allow arbitrary positive integers less than d, and call them the digits of & and 
write € = &(dm41,dm+3,--..). We thus have shown that & is a Liouville number 


amod b 
in L(Shord . 4% 
amo 


It remains to show that L(<“°*—) is uncountable. For this purpose we assume 
that the set of all restricted Liouville numbers 


E(Am+1, Am+35 +++ 5Amt2k+1) +++) with = dn+2K%41 € {0, 1,...,d— 1} 
from the construction above (with fixed a,,...,...,@ and 5) is countable and 
: (n) (n) (n) 
En = E (Gaga ys Gp 30 ee Ame aet ye +e) for n=1,2,... 


is a list of those. Define 


: lif Orn Dt #1, 
E = E(Dint ts Bnt3,-+ +s Omt 2415 +++) with = bmtoxe+1 = 

0 otherwise. 

In view of this choice, we obtain € 4 & for any k € N which shows that already the 
set of all E(Qin4-1, G43, +++ Qm+2k-+1;---) is uncountable. The theorem is proved. 


Our next aim is to prove the analogue of Erdés’ Theorem 2.1 for restricted 
Liouville numbers. However, for this purpose we shall consider infinite series in 
place of continued fractions. 
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Theorem 4.2 Assume that b and c are both coprime with d. Then, for every real 


number t there exist restricted Liouville numbers &, & € L(4 te ») such that 


t=& + &. 


Proof By assumption, there exists a non-negative integer e such that b° = c mod d. 
Let B = b?, where g is Euler’s totient. For a given p we consider the B-adic 
expansion of tb°, i.e., 


the =yo+ Do y,B" — with yo € Z, yn € {0,1,...,B— 1}. 


n>1 


It is easy to see that with € also 4& + z with integer z is a restricted Liouville number 


from L(oees £). Therefore, we may suppose that yo is zero. For j = 1,2, we define 


6= Tarr 


n>1 

by assigning digits po as follows: fork! <n < (k+ 1)!, 

BO =y, B2 =0 if k=1mod 2, 
and 

BY =0, B2 =y, if k=O mod 2: 
moreover, we let 

Lg, BO =y-a if k=1mod 2, 
and 


O =y—a, BY =a if k=O0mod 2. 


Consequently, Sue pe = yp for all n and t = & + &. It remains to show that 


amod b 
§j € Ered d’* 


For j = 1,2, put q? = B+) 'pe and 


G) — 


Pad? Sy were = Layne 


n<(m+1)! n<(m+l1)! 
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Consequently, g = c mod d and p¥) = ch 4p! = amod b. Hence, the residue 
conditions are fulfilled. Furthermore, it follows that 


(i) 
0 Pm _ (A) BoAp-e 
ae ee ee | 
im n>(m+1)! 


In view of the construction the series on the right-hand side has vanishing digits for 
n < (m+ 2)! whenever m = j mod 2 which yields the upper bound 


pe) 1 1 
ar ey a 2 ca < — TAC 
Bee” any" (qn) 


px BB Nye < Bont! p-e — 
n>(m+2)! 


The theorem is proved. 


Our next and final aim is to prove stronger statements by use of Baire’s theorem. 
It is remarkable how easy the categorial method leads to stronger results than the 
just given constructive approach. 

First of all, we derive from the definition of restricted Liouville numbers the 
following representation as 


vemty=@\on| U U (2-—2+5) 


1 
n=1 q22 Li 
= q=c mod d p=a mod b 


Thus, L( mod e 7) is residual and a Gs-set. It is not difficult to show that the image of 


a+ bx 


>NxN ; 
f x LIN aa 


is dense in R>o. Hence, it follows that also L(4 mod 4) is dense in R (without any 


assumption on a, b,c, d). Consequently, by Erd6s” Theoea 2.2, we obtain 


Corollary 4.3 For any choice of positive integers aj < bj, cj < dj (j = 1,2), 


R= L(amed 1 mod Pty 4 jens mod ba) 


cy; mod d; c2 mod dz 


Finally, we shall prove an analogue of Wolfgang Schwarz’s Theorem 2.5 for 
restricted Liouville numbers: 


Theorem 4.4 Given positive integers a < b,c < d, (at most) countably many 
continuous open sie a : (0,1) > R, and sets of restricted Liouville numbers 


Laie oe mod be i ) forr = 1,2,..., there exists a dense Gs-subset D of (0, 1) consisting 
only of restricted Liouville numbers — € L(4 amoe f) such that all f,(€) are restricted 


; é a, mod b, = 
Liouville numbers from L( ae az) for oe eee 
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Proof (along the lines of Schwarz’s reasoning) Every set Lee) of restricted 


Liouville numbers is a dense Gs-set. The same is true for the intersection with B, 
f,((0, 1)). In view of the continuity of the f, the pre-images A, := f-! Oo Creve see be) a nN 
B,) are also Gs-sets; since each f, : (0,1) — B, is open and surjective, the pre- 


images A, are dense in (0, 1) as well. Thus, the countable intersection 


cmod d 


D = (L(244)N0,)) N()A- 


is again a dense G;-set (and in particular uncountable). It follows from 


f(D) C flA,) Cfe(fo | (L( 22S 2 B,)) C L(£ 2%) 2 B,, 


cy mod d, c, mod d, 


that f.(&) € L(4- 2%) for every € € D. The theorem is proved. 


cy mod d, 


5 Concluding Remarks 


The problem of diophantine approximation with residue class constraints for the 
numerators and denominators of the rational approximations was first considered 
by Stanistaw Hartman [37] and Jurjen Koksma [51]. More recently, Carsten 
Elsner [24, 25] investigated Liouville numbers having rational approximations with 
denominators from thin sets. However, his approach and aim is rather different from 
ours. Indeed, we could not find anything comparable to our results in the literature. 

Of course, one could ask whether well-known results on Liouville numbers from 
the literature can be extended to restricted Liouville numbers. For example, Yann 
Bugeaud [12] proved that i) there exists a set of Liouville numbers which are not 
normal in any integer base and whose h-Hausdorff dimension is infinite for any 
sufficiently fast growing dimension function A, and ii) there exist Liouville numbers 
that are normal in all integer bases. 

There are similar results on representations of real numbers as a sum of few sum- 
mands or as a product of few factors. For instance, Marshall Hall [35] showed that 
every real number can be written as a sum of an integer and two regular continued 
fractions each of which having partial quotients less than or equal to four.? 

Bagemihl and Wtadimir Seidel [3] used category theory for further studies in 
diophantine approximation theory; their result can be considered as a topological 
reformulation and extension of Kronecker’s inhomogenous approximation theorem 
[52] for Hausdorff spaces. Also this question was raised by Erdés; moreover, there 
are several joint publications of Erdés with Bagemihl and Seidel on questions 
around topology in the 1950s, e.g., [4]. 


43 Another much simpler example provides the middle third Cantor set C. It is a folklore theorem 
that every real number can be written as a sum of an integer and two elements of C. 
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Furthermore, let us remark that Goldbach’s number § = >» 10-2" defined 
by (1.1) in the introduction is not a Liouville number. Nevertheless, it is a 
transcendental number as was first proved by Aubrey Kempner [49] in 1916; an 
alternative easy proof relies on Roth’s theorem. It is remarkable that the sequence of 
partial quotients of the continued fraction expansion of & is bounded as was shown 
by Jeffrey Shallit [74]. By the way, a related transcendental quantity leads to the 
following trivial and completely useless criterion for Fermat primes: there exist only 
finitely many Fermat primes fy := 27" + 1 if and only if 


pas 10% 


n>=0 
fn is prime 


is rational. It is expected that there are no Fermat primes besides fp = 3,f, = 
5, fo = 17, fg = 257, fy = 65 537. 

We conclude with a quotation from Wolfgang Schwarz’s lecture notes [72] to 
a course on the history of number theory at Frankfurt University; in the preface 
Schwarz describes the concept of the course and the idea of historical studies**: 


Die Vorlesung ist damit ein Mittelding zwischen einer Geschichte der Zahlentheorie und 
einem zahlentheoretischen Ergebnisbericht. Ein Ziel geschichtlicher Untersuchungen, 
Orientierung zu vermitteln, kGnnte damit erreichbar sein. 


In the course of writing this article the author has learned a lot from both, the 
mathematical and the historical point of view on Liouville numbers. It is not only 
the result that matters, but also the way we achieve it. 
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Abstract For an algebraic number field K # Q we prove that the unit disc 
is a natural boundary of the power series > axe zg running through the 
integral ideals of K and N denoting the norm function. As an application, we 
deduce the same result for power series )° shy g(a) 2’ with specific multiplicative 
coefficients g(a) thereby extending known results to algebraic number fields. 
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1 Introduction 


Functions f: N > C satisfying f(1) = 1 and f(mn) = f(m)f(n) for (m,n) = | are 
called multiplicative. Under certain general conditions on f the generating power 
series 


PY.D=) fz (1.1) 


n=1 


converges and represents an analytic function on the open unit disk U := {z € 
C: |z| < 1}. The classical problem of analytic continuation of P(f, z) beyond dU 
was studied in [5, 6]. In particular, for a large class .% of multiplicative functions 
f, Lucht [5, Theorem 4] provides an equivalent arithmetical characterization of the 
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singular behavior of P(f, z) on OU. This can be applied to many important arithmetic 
functions, especially to functions f € W characterized by Wirsing [7, Satz 1] (for 
the definition of Y see [5, p. 368]). 

The aim of this paper is to transfer the above-mentioned results to complex 
valued multiplicative functions defined on the set /)(K) of non-zero integral ideals 
of an algebraic number field K. Functions g:/o(K) — C satisfying g(e¢) = 1 for 
e = (1) and g(ab) = g(a) g(6) for (a, 6) = ¢ are called multiplicative. The identity 
function n +» I(n) = n forn € N then corresponds to the norm function a b> Nx (a) 
counting the residue classes modulo a. In particular, for K = Q we have Ng(n) = n 
if n = (n). For any algebraic number field K, Nx is completely multiplicative, i.e. 
Nx(ab) = Nx(a) Nx(6) for all a, 6 € Jp(K), and the power series (1.1) takes the 
form 


P{K,g.d0= D> sa — (W(a) = Nx(a)). (1.2) 


a€lo(K) 


The usual multiplicative functions on N are related to those on Jp(K), as (1.2) may 
be rewritten as 


P(K,g,z) = G(n) 2" 
1 


n= 


with 


Gin) = Yo g(a). (1.3) 


N(a)=n 


and G is multiplicative if g is multiplicative (compare [3, § 41]). Provided that G € 
Ws, [5] entails an immediate characterization of the singular behavior of P(K, g, z) 
for z € OU. 

Observe that the constant function /:J)(K) — {1} with /(a) = 1 for alla € 
Io(K) corresponds to the function F with 


Fn)= D> 1, (1.4) 


N(a)=n 


the number of integral ideals of K with norm n. In Sect. 2 we shall see that F does not 
necessarily belong to W (Remark 2.7). Hence the results of [5] cannot be applied 
to F, in general. 

In Sect. 2, we therefore introduce certain classes .% for e € N such that #7 C 
|). and corresponding classes % refining W% accordingly. As the methods and 
results of [5] then can be easily transferred from % and % to % and %, we obtain 
equivalent characterizations of the singular behavior of P(K, g, z) for z € 0U in case 
of Ge “4 andGeé % (Theorems 2.3 and 2.6). 
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For the application of Theorems 2.3 and 2.6 to specific functions g: Jo(K) — C it 
remains to verify that G « Y% for some e € N. Actually this is not even evident for 
the simple constant function / corespondite to F via (1.4). For the rational number 
field K = Q we have P(Q, /,z) = ;& for z € U, which obviously has an analytic 
continuation beyond dU. But this dons no longer hold for K # Q. In Sect.3 we 
prove the main result of this paper: If K 4 Q, then F € &% for some e € N, and 


> 2N@) 


acélo(K) 


has no analytic continuation beyond dU (Theorem 3.1). The proof uses results of 
algebraic number theory and can also be used to characterize those F with F € % 
(Remark 3.2). 

Finally, in Sect. 4 we discuss further specific arithmetic functions g: [9(K) > C 
to find that known negative results concerning the analytic continuation of P(Q, f, z) 
beyond dU remain valid when considered in algebraic number fields K 4 Q. The 
examples permanently use Theorem 3.1. 


2 The Classes .% and Y, 


We begin with defining the classes .%,* and .% of multiplicative functions f: N > C 
for e € N which represent a slight modification of the classes “%* and .% in [5]. 
Let M(f,x) denote the sum 


M(f.x) = >> f@). 


nsx 


Definition 2.1 For e € N let * consist of all multiplicative functions f:N > C 
with the following properties: 


(a) There exists some s € C witho = Res > 0 and a slowly oscillating function L 
such that for each q € N, (q,e) = 1, there is a constant cg € C with 


CqX' L(x) + 0 (x7 |L(®)|) if x = Xo mod g 


M ; = 
WN 56s i) pl aamidlg 


(x > 00) 


for all characters y mod q. 
(b) There is some q* € N with (q*,e) = 1 and cqx # 0. 
(c) For each prime p there is some € > 0 such that the series 


FAG 20) 
= po- e)v 


converges. 


512 F. Tuttas 


Here L: [0, 00) — C is called slowly oscillating if L is measurable and L(x) ~ 
L(cx) for x > oo, uniformly for 1 < c < 2. 

In contrast to .#*, the definition of .%,* requires coprimality of g, e and of q*,e 
in (a) and (b). Clearly %* = “*. Bothf € “* and c, # 0 for some a € N entail 
that f € .%,* for all e € N with (e,a) = 1. Hence #* C L) .%* forallaeN. 


e>a 
To formulate Lemma 3, Corollary 2 and Theorem 2 in [5] for f € .%* only 
require a redactional revision which does not change the assertions and may be left 
to the reader. We note that the number s from Definition 2.1 is uniquely determined: 
S= Sp. 
Definition 2.2 For e € N let .% be the class of functions f € ,* satisfying the 
additional property: 


(d) lim x L(x) does not exist. 
xX 0O 


Similarly we see that “4 = “ and # C |) % forallaeN. 
e=a 
For the function classes under consideration the power series P(f,z) converges 
for all z € U, and the characterization of functions f € .% having OU as natural 


boundary according to [5, Theorem 4] transfers to functions f € .%: 
Theorem 2.3 Let f € %, s = sy. Then the following assertions are equivalent: 


(A) P(f,z) has U as natural boundary. 
(B) There are infinitely many prime powers p* with p + e such that 


1 fp) 
np(f.p.8) # —~ op) pats’ 


Co ee 
where np(f,p.5) = “EP for BEN. 
v=, 


Next we recall the class of multiplicative functions investigated by Wirsing [7]. 


Definition 2.4 Let Y denote the set of multiplicative functions f:N — C with the 
following properties: 


(a) fx) =O forneN. 


(b) y pera <oo forsome é¢> 0. 
p.v>2 


(c) lim 4} f(p)logp £0. exists. 
xoo * Hz 


For e € N let further % be the class of functions f € W such that for each q € 
N with (q,e) = 1 and each character y mod q, x # Yo, the following holds in 
addition: 


(d) lim r US (P) x(p) logp exists, 


(e) > suol=tes) diverges. 


Pp 
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Note that condition (b) in Definition 2.4 is a slight generalization of local 
restrictions imposed on the values of f at the prime powers p” with v > 2 in 
Wirsing’s paper [7] such that, in particular, Satz 1 and Satz 2 in [7] remain true. 
This yields that % C “%. We summarize: 


Corollary 2.5 Y% C % foralle €N, sf = 1 for anyf € WH, andW =. 


As an immediate consequence we obtain the following result for multiplicative 
functions g:1)(K) > C. 


Theorem 2.6 Let K be an algebraic number field, g:In(K) — C a multiplicative 
function and G defined by (1.3). If there is some e € N such that G € %, then the 
following assertions are equivalent: 


(A) P(K, g,z) has U as natural boundary. 
(B) There are infinitely many prime powers p® (B € N) with p + e and 


1 G(pf") 
Ga) 4—=—— 
np(G, p 1? oe Pal 


In the introduction we noted that, in general, F € % is not true which is justified 
here: 


Remark 2.7 There are algebraic number fields K 4 Q such that the function 
F:N — Z associated with the constant function /:19(K) — {1} according to (1.4) 
does not belong to #. 


For, if K is an algebraic number field containing the mth cyclotomic field Q(e* ) 
then, according to [1, Lemma 3, p.351], N(p) = 1 mod m holds for every prime 
ideal p which does not divide m. For m > 3 there exists a non-principal character 
x mod m. Hence 


y F(p) — Re x(p)) _ y PaREAP) = yy. 


psx . p=N(p)<x P 
ptm ptm 


which contradicts Definition 2.4 (e). 


3 The Power Series )* 7 
aélg(K) 
In the sequel we study the special case g = /. 


Theorem 3.1 For every algebraic number field K # Q the power series 
PRia= y, 2 


acl (K) 


converges in U and has 0U as natural boundary. 
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Proof We have to verify that the function F associated with J according to (1.4) 
satisfies 


) Fe F&, 
(I) F € % for some e € N, 
(II) F satisfies condition (B) from Theorems 2.3 and 2.6. 


Proof of (1): Condition (a) of Definition 2.4 is satisfied as is (c) by Landau’s 
prime ideal theorem, for 


XxX 

5 F(p) ~ ; (x > oo), 
ogx 

psx 


and partial summation implies (c). 
To verify (b) we estimate F(p’). Let [K : Q] = and let a an integral ideal of K 
with N(a) = p”. Then the prime ideal divisors p;,..., p, of a satisfy 


a=pii---p?, a EN, M(po) =p’ withy, €N, r<n. 
Taking the norm on both sides yields p” = p"!™! --- p’’*" or, equivalently, 
Vid) Fees + Vy = V (A, ¥9 € N, r <n). 


For the number L(v) of solutions of this equation we obtain with the divisor 
function T, 


L(v) = 5 i< J) T(my)-+-T(m,) 
V1,.-.V->1 m|,....mp>1 
via t+, =v my ++my,=v 
E r<n 
n n 
< vy y l< y vev <(v +1)". 
r=1 m4,....M->1 r=1 
my t+m,-=v 


This entails F(p”) < L(v) < (v + 1)", and (b) is satisfied for 0 < ¢ < ; : 


Proof of (ID: For gq € N let L = K(€) with ¢ = et the cyclotomic field over 
K. Then L is a normal extension of K with an abelian Galois group, isomorphic 
to some subgroup of the prime residue group modulo q. The Kronecker—Weber 
theorem implies the existence of some m € N such that [L : K] = ¢(q) for all q, 
(q,m) = 1; see Hasse [2, Satz 133, p. 196]. For those q the Galois group of L/K 
is isomorphic to the prime residue class group modg and, in particular, abelian. 
To verify properties (d) and (e) of Definition 2.4 we use Chebotarev’s density 
theorem. We recall the following: 

Let p be a prime ideal of K and ‘8 a prime ideal divisor of p in L. Then for 
unramified p the Frobenius automorphism o uniquely determined by p and ‘8 
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satisfies 


EN) = o(€) mod P 


for all integral € € L. This holds for any normal field extension L of K. If t € 
G = Gal(L/K), then the conjugate prime ideal 1 (98) of $8 corresponds to the 
Frobenius automorphism tot~!. The conjugacy class of o in G is independent 
of the prime divisors of p in L and is denoted by the Artin symbol (4) (= 


icot see G}). For a conjugacy class C of G let further 


mc (x, L/K) — ba 1 ; 
p unramified in L 
N(p)Sx 


(4#)=c 


Chebotarev’s density theorem says: Let K be an algebraic number field and La 
finite normal field extension of K with Galois group G. Then 


Icl 
|G| logx 


c(x,L/K) ~ (x > oo). 


This asymptotic equation suffices for our purposes, for a version with remainder 
term see Lagarias and Odlyzko [4]. 

We apply this theorem to L = K(¢) with ¢ = et and (q,m) = 1 where m was 
chosen such that |G] = (gq) for (q,m) = 1. Letting e = m we have to verify 
conditions (d) and (e) of Definition 2.4. 

As G is abelian the conjugacy classes of G are singletons. Denote the automor- 
phism € + ¢% by og. Then G = {og : (a,q) = 1}. Let p neither divide the 
relative discriminant of L/K nor the discriminant d of the polynomial x4 — 1, and 
let (#4) = {o,} for some a, (a,qg) = 1. Then, for all integral € € L and all 
prime divisors 5% of p, 


EN®) = o,(&) mod P, 


hence 


ENP) = o,(&) mod p, 


and thus 


cN®) = 4 mod p. 
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Suppose that ¢%) 4 C4, then £%®) — ¢4 divides the discriminant d so that p | d,a 
contradiction. Therefore ¢%?) = €4, so N(p) = a mod q. Now by Chebotarev’s 
theorem 


1 x 
> 1~ — (x > 00). 
pee p(q) logx 


N(p)=a mod q 


Since we can neglect prime ideals of higher degree this asymptotic relation yields 


1 x 
dX ea (x > 00). 


p=a mod q 


Partial summation gives 


> Haines (x > 00) (3.1) 
~ o() 
p=a mod q 
and 
F 1 
CY pe Deiat senet acl (3.2) 
ae ~(q) 
p=a mod q 
which implies (d), 


Flr) xp) logp = J xa) YO F(p)logp=00) > &) 
psx a mod q psx 
p=a mod q 


for each character y 4 Yo mod q, (q,m) = 1. Here * indicates summation over 
the prime residue classes mod q. 
In the same way we obtain (e), namely, for y # Xo mod gq, (g,m) = 1, by (3.2) 


1—-R * 
yp PMG Rex) _ Reg) DO FO 


P 


psx P amod q pxx 
p=a mod g 


= = ) loglogx: yy ( 1— Re x(a)) (x—> 00). 


amod q 


(3.3) 
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For g > 3 and y # Yo mod q there is some a € N with (a, q) = 1 and x(a) £ 1. 
Hence | — Re y(a) > 0, the sum factor in (3.3) does not vanish, and the series 
(e) diverges. 

Proof of (TI): We have sr = 1. As in the proof of (II), let m € N be such that 
F € Y,,. Suppose that condition (B) of Theorem 2.6 is not valid for F. Then 
there are at most finitely many primes p with F(p) = 0, because otherwise (B) 
would hold for these p with 6 = 2. 

For all sufficiently large primes p > m exceeding the discriminant of K we have 
F(p) = 1, pis not divisible by any prime ideal square, and 


m(F.p.1) = —— 
p(P) 
for B = 1. It follows that F(p”) > 1 for v > 1, and hence 
1 1 
g(p) Cp)" 


Hence F(p”) = 1 for v > 1, and the ideals (p) are prime ideals of first degree 
of K. Taking the norm yields p” = N((p)) = p, son = [K : Q] = 1, that is 
K = Q, which is a contradiction. Therefore F has to satisfy (B), which completes 
the proof of Theorem 3.1. 


1 1 
PP 


The proof of (III) could be simplified a little by observing the known fact that, 
for each K 4 Q, F(:p) = 0 holds for infinitely many primes p. Namely, let L be the 
normal closure of K/Q, let n = [K : Q] > 2, N = [L: QJ, and let the functions 
attached to K and L, resp., according to (1.4) be denoted by Fx and F_, resp. Then 
a prime p is totally split in L iff it is totally split in K, and we have for x > oo 


1 x 
X15 Ll wings 


psx psx 
Fx(p)=n FL (py=N 
x n—1 x 
Dr =Usouy A= Yo 14+( sow) & 
‘ logx N logx 
pSx psx 
1<Fx(p)Sn 
n—1 x 
l= 1—- 1>{— 1 : 
Hisd Ye ( roo) Se 
DSx psx psx 
Fx(p)=0 1<FK(p)<n 


We mention that, for the same reason, a previous non-continuability criterion [6, 
Theorem 5] cannot be applied to F, as the requirement 


Y= IF@)P = 0() 


nsx 


is not satisfied for K 4 Q. 
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As regards the question when F € %, a partial answer was given in Remark 2.7. 
Using the argument in the proof of (ID), we can now characterize the functions F 
belonging to %. 


Remark 3.2 For K ~ Q we have F € % iff Q is the only (absolute) abelian 
subfield of K. 


For, if K does not contain any abelian subfield Z#QKNQnz = Q for all 


cyclotomic fields Q, = Q(é), ¢ = en. According to Galois theory, we have for 
L=K(é) 


G = Gal(L/K) © Gal(Qn/K N Qn) = Gal(Qn/Q), 


and thus |G| = g(m). Using the same argument as in the proof of (II), we obtain 


1 

Y> F(p)~ —~—— (x > ©, (a,m) = 1), 
= y(m) logx 

p=a mod m 


and hence F € Y%. On the other side, if Ky = K 1 Qn 4 Q for some m > 3, we 
have 


G = Gal(Q,,/Ko) = H, 


where His a proper subgroup of the prime residue group modulo m. We can find 
a character y modm, y # yo , such that y(a) = 1 for all a € H. According 
to the proof of (II), we have for each prime ideal p of K (except at most finitely 
many) N(p) = a mod m for some a € H. Therefore, for sufficiently large p > po, 
F(p) = Oif p = bmodm, b € H. It follows that F(p)(1 — Re x(P)) = 0 for 


F(p)(1-Re x(p)) 
P 


P = Po, the series Le is convergent and F ¢ %. 


4 Examples 


Let K be an algebraic number field, not necessarily distinct from Q, and let n = 
[K : Q| be the degree of K over Q. According to the proof of Theorem 3.1, fix 
m € N such that F € Y,,. In the sequel we consider specific multiplicative functions 
g:Io(K) — C so as to show that dU is the natural boundary of the corresponding 
power series P(K, g, z). Again, with G: N > C given by (1.3), 


Gin) = Yo g(a), 


N(a)=n 


G € Y,, will follow from F € Y,,. Then Theorem 2.6 applies and it remains to 
verify that condition (B) of this theorem holds. 


Natural Boundaries of Power Series with Multiplicative Coefficients 519 


All examples to be considered here show certain similarities which we outline 
first to clarify and shorten the argumentation. 

First we will always have G(n) > O and G(p) = F(:p) c(p) with c(p) = C+o(1) 
with some constant C > 0 for all primes p € P. It follows from Theorem 3.1 that 
conditions (a) and (c) of Definition 2.4 are satisfied. For the proof of convergence of 
the series in Definition 2.4 (b) for some ¢ € (0, 3) we set 
H(n) = max {g(a) : N(a) = n}. 


As g will only take non-negative real values in our examples, we see that H(n) > 0 
and 0 < G(n) < F(n) A(n) for n € N. Therefore the convergence of the series in 
Definition 2.4 (b) is guaranteed by any inequality of the form 


A(p") < vv + 1% (4.1) 


with some constants a, y > 0 which might depend on n. Combined with the already 
proved inequality F(p”) < (v + 1)?” this would imply G(p’) < y(v + 1)7"T*. 
Conditions (d) and (e) of Definition 2.4 follow immediately from the above 
assumptions concerning G(p) combined with (3.1) and (3.2). Altogether (4.1) yields 
Ge W,, and sg = 1. 
Suppose that condition (B) of Theorem 2.6 does not hold. Then for all sufficiently 
large primes p > po > mandallaeéN, 


G a—1 
eps 22. 
P(p) Pp 


Taking differences gives 


G(p4 1 1 G(p* 
Na(G, p, 1) — na+i(G,p, 1) = te Pe — (civ - 
P g(P) P 


which entails that 


G(p") = G(p“') 


for alla € N and p > po. Hence, for a = 1, G(p) = F(p)c(p) = 1. This is a 
contradiction since, according to the remark following the proof of Theorem 3.1, 
F(p) = 0 for infinitely many primes p. 

As an intermediate result we note 


Corollary 4.1 Let g:Ip(K) — R be a multiplicative function with non-negative 
values and G:N — R the multiplicative function associated with g according 
to (1.3). Suppose that G(p) = F(p) c(p) where c( p) = C+0 (1) with some positive 
real constant C. If there are positive constants a, y which may depend onn such that 
H(p’) < y(v + 1)° for all prime powers p” with v € N, then G € WY, and U is a 
natural boundary of P(K, g, z). 


The first example deals with g(a), the number of prime residue classes mod a. 
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Example 4.2. S~ (a) z\™ has dU as natural boundary. 
ao 


Proof It suffices to verify the assertion for g(a) = 


(a) _ 1 
N@) = IT( ~ wo) 
We have G(p) = F(p)(1—+), hence c(p) = 1—1, H(n) < 1, and Corollary 4.1 


: : . P Pp? 
implies the assertion. 


For integral ideals a of K and k € N we may define the sum of divisors function 
by ox(a) = D7 N*(Q). 


dla 
Example 4.3 > o;(a) 2 has dU as natural boundary. 
a#o 


Proof \t suffices to verify the assertion for g,(a) = =. 


We have G;(p) = F(p)(1+ xx). hence c(p) = 1+ x . To show the convergence 
of the series in Definition 2.4 (b) we proceed as in the proof of Theorem 3.1, part 
(I), and set 


=) we with a € N, N(Po) = p”? with Vo EN, r<n (4.2) 


for integral ideals a of K with N(a) = p” and distinct prime ideals py, @ = 1,...,7. 
Then 


neve OP) oK(PHT) (14 a 


~ NET) NE (pe) yet 7 a) 2(ay 


puik 


and therefore H(p”) < (v + 1)”. Corollary 4.1 establishes Example 4.3. 


For integral ideals a of K we define the divisor function by t(a) = )> 1. 
dla 


Example 4.4. > t(a)z™ has dU as natural boundary. 
a#o 


Proof We have G(p) = 2F(p) and c(p) = 2. For the proof of convergence of the 
series in Definition 2.4 (b) we use (4.2) for integral ideals a with N(a) = p” again 
to obtain 


vid} +---+ V/A, = V, 
(a) +1)+---+(@-+1)<v4r. 
The arithmetic-geometric mean inequality yields 
v+ "y' 
a : 


r(a) = (@ + 1)+-@, + 1) <( 


Therefore H(p”) < (v + 1)", and Corollary 4.1 implies the assertion. 
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With the Mébius function jz defined as usual for integral ideals of K, 4*(a) 
represents the characteristic function of the squarefree integral ideals a of K. 


Example 4.5. > ?(a) zY has dU as natural boundary. 
a#o 


Proof Corollary 4.1 can be applied since we have G(p) = F(p), c(p) = 1, 
A(n) < 1. 
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Abstract A simple proof of Hardy’s theorem on the existence of power series 
which converge uniformly on the unit circle but not absolutely. 
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1 Introduction 


There are power series which on the unit circle converge uniformly but not 
absolutely. 

This was proved in 1913 by Hardy [3], answering a question of M. Riesz. 
Hardy gave three examples, all closely related to the series )*, n~!“‘(logn)~!z". 
For the proof he uses a Tauberian argument showing that the partial sums of the 
series )*>,n~!~‘z" are uniformly bounded on the unit circle. Hardy remarks that 
J.E. Littlewood had an elementary proof for the later statement, but this, apparently, 
was never published. E. Landau in his famous book Einige neuere Ergebnisse 
der Funktionentheorie presents a simplified version of Hardy’s proof, cf. Landau- 
Gaier [5]. 
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Further proofs have been published since, but all, as far as I know, employ some 
more or less advanced tools. Thus Gaier [2] elegantly uses the conformal mapping of 
the unit disk to a region with a rectifiably unreachable boundary point. Herzog and 
Piranian [4] use lacunary series, Erdés et al. [1] additional combinatorial arguments. 
A certain similarity to the proof below may be found in the one of Lésch [6] who 
applies Abel’s theorem in its generalization to angular spaces to series of the type 
ar CU" po (z) with E(z) = 1 — V1 —zand suitably growing k,. 

At a conference in Colfosco in 2004 W. Luh lectured on a proof of his that is 
elementary and quite easy. It was this one that inspired me to the one presented in 
this note. We start from first principles and do not require any calculation. 


2 The Proof 


For polynomials P over C let 


M(P) := the maximum of |P(z)| on the unit circle, 
A(P) : 


the sum of the absolute values of the coefficients. 


Both of these functionals stay unchanged if P(z) is replaced by z”P(z) or P(z”). 
Moreover Q(P) := M(P)/A(P) for P 4 0 does not change if P is multiplied with a 
nonzero constant. Obviously Q(P) < 1. 


Lemma 2.1 For any ¢ > 0 there are polynomials P such that Q(P) < «. 


To derive Hardy’s result from the lemma pick polynomials P; such that Q(P;) < 
1/k, normalize them so that A(P;,) = 1/k and, consequently, M(P;) < 1/k?, and 
line them up nonoverlapping into a power series )°,, a,z” by making 


» ang’ = Zt PYG) 


Nk<NSNg+1 


with suitable ny. This series does not converge absolutely on |z| = 1 since 


Yilal = FAey = PE = ow. 
n k k 


On the other hand, the particular partial sums s,, do converge uniformly because 


1 


Sri — Sng = |Pe(2)| < M(Px) < Rp’ 
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and an arbitrary s, is linked to the special s,, with ng <n < ng+1 by 
1 
[Sn —Sn,| < A(Px) < Z > 0. 


Thus our power series does converge uniformly on |z| = 1. Oo 

To prove the lemma it suffices to produce one polynomial P; with Q(P\) < 
1, since for any two nonzero polynomials R and S and an exponent g > degR 
we have A(R(z)S(z8)) = A(R) - A(S), immediately implying sub-multiplicativity 
Q(R(z)S(z8)) < Q(R) - Q(S). Thus for any g > deg P| the recursion 


Pei) = Pi@Pele) for. kS1 


generates a sequence of polynomials with Q(P;,) < Q(P\)* > 0. 
Actually Q(P) = Q(co +++: + caz“) = 1 is but an exception if d > 2. If we 


allow nonzero c, only, then at the place a of the maximum (|a| = 1) the equation 
| >>, Cna"| = ¥,, |cn| cannot hold unless all a’c,/co are positive, and in particular, 
hence, coc2/ G > 0. oO 
3 Remark 


Polynomials with small quotient Q are known. A nice example are the Fekete 
polynomials, that are built with the Legendre symbol (here for primes p): 


p-l 
n 
Fp = Yo (=), 
n=1 P 
for which one can show quite elegantly that 
log p 


QF) <« 


Be 


What one needs is essentially that the Gau8 sums have modulus ,/p. But that, of 
course, is an “advanced tool’. 
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Abstract A quasiplatonic curve can be defined over the rationals if it has a regular 
dessin with abelian automorphism group. 
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1 Basic Facts and Definitions 


The following facts and definitions are fundamental for the theory of dessins 
d’enfants on Riemann surfaces. Proofs and more details may be found in [11] and 
the references given there. 


1. Compact Riemann surfaces Y are smooth projective algebraic curves, holomor- 
phic mappings between such Riemann surfaces are morphisms between the 
algebraic curves. 

2. As a curve, Y can be defined over a number field if and only if there is a Belyr 
function B : Y > C = P!(C) on Y, that is a meromorphic function, non- 
constant and ramified at most above 0, 1,00 [1, 7, 9, 12]. In this case we call Y 
a Belyi surface or a Belyi curve, (Y, B) a Belyi pair. 

3. The 6-preimage of the real interval [0, 1] defines a bipartite graph on Y with 
B-'(0) and B'(1) as the sets of white and black vertices, respectively. The 


Both authors were supported by the DFG project Wo 199/4-1. 


J. Wolfart (1) * B. Miihlbauer 
Institut fiir Mathematik der Goethe Universitat, 60629 Frankfurt a.M., Germany 
e-mail: wolfart @ math. uni-frankfurt.de 


© Springer International Publishing Switzerland 2016 527 
J. Sander et al. (eds.), From Arithmetic to Zeta-Functions, 
DOI 10.1007/978-3-319-28203-9_32 


528 J. Wolfart and B. Miihlbauer 


graph subdivides Y into simply connected cells, and we can consider the poles 
of 6 as their midpoints. This embedded graph J is called the dessin d’enfant [8] 
corresponding to 6. 

4. On the other hand, if we embed a bipartite graph Z in a compact oriented 2- 
manifold Y such that Y \ Y is the disjoint union of simply connected cells, 
then there is a unique conformal structure on Y and a unique Belyi function 6 
corresponding to F ([8, 14] or [11, Theorem. 3.15]). 

5. For every Belyi pair (Y, 6) , that is for every dessin, there are cocompact Fuchsian 
triangle groups A and finite index subgroups ® < A suchthat Y ~ @\H where 
HH denotes the hyperbolic plane and such that the Belyi function can be written 
({17, Theorem. 3], [11, Theorem. 3.10]) as the canonical mapping 


®\H > A\H =~ C: 2b Az 


The entries in the signature (/,m,n) of A are common multiples of the zero 
orders of 6, 1 — 6 and 1/f, respectively. If 1,m,n are chosen minimally, we 
call (/,m,n) the type of the dessin. For some triples of small numbers A will be 
an Euclidean or a spherical triangle group and H has to be replaced with C or the 
Riemann sphere on 

6. The dessin is called regular if there is a group G < AutY acting transitively 
on the set of edges of Y and preserving the colours of the vertices. If Y has a 
regular dessin, it is called quasiplatonic (or curve with many automorphisms in 
older papers, triangle curve in the literature about Beauville surfaces). There are 
many other characteristic properties ({17] or [11, Theorem. 5.1]), for example 


— the Belyi function f defines a normal covering, 

— its monodromy group (the so-called cartographic group of the dessin) is 
isomorphic to its covering group, 

— @isanormal subgroup of A, andone has A/@ YG. 


Every Belyi surface (or every dessin) has a quasiplatonic (respectively regular) 
covering of finite degree. 

7. The field L is called a field of definition for Y if all coefficients of the polynomials 
defining Y are contained in L.. More generally, we call L a field of definition for Y 
if there is a curve Y’ isomorphic (over C) to Y defined over L. For o € Gal Q/ Q 
we denote by Y° the curve obtained by the action of o on all coefficients of the 
defining equations of Y . In the context of the present paper, we may suppose that 
L is a number field; it is not at all uniquely determined, however the moduli field 
M(Y) of Y, that is the fixed field of the subgroup 


Vy := {0 € GalQ/Q| Y’ = Y} < GalQ/Q 
of the absolute Galois group depends only on the isomorphism class of Y. It is 


contained in all fields of definition of Y. For any number field K we can define 
similarly a field of definition > K and a moduli field Mx(Y,G) for the pair 
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consisting of Y and a group G < AutY of automorphisms as the fixed field of 
the group 


Vv.ay/K = {0 € GalQ/K | 3f, : Y > Y’ isomorphism with 
fo°a@ =a? of,Va € G}. (1.1) 


8. It is well known that the moduli field is not always a field of definition of Y 
(5, 13]. According to Weil [16] it is a field of definition if there is a choice of 
isomorphisms f, : Y — Y° for all o € GalQ/M(Y) satisfying the cocycle 
condition 


for =fi of, forall o,t € GalQ/M(Y) 


9. By consequence, Y can be defined over M(Y) if the identity is the only 
automorphism of Y because then all f, are uniquely determined. By variations 
of this argument, also regular dessins, that is Belyi pairs (Y, 6) with normal 6 
can be defined over their moduli field M(Y, 8) [3, Proposition 2.5], obviously 
defined as the fixed field of 


Viv.p) = {0 €GalQ/Q | Af, : Y > Y° isomorphism with B° of, = B}. 


From this result one can derive that also quasiplatonic curves can be defined 
over their moduli field ([4], [17, Theorem 5], [11, Theorem 5.4]). Similarly, 
(Y, Aut Y) can be defined over its moduli field if Aut Y contains its centre as 
a direct factor [6, Theorem 1]. 


2 Abelian Automorphism Groups 


Theorem 2.1 Let Y be a regular dessin on a quasiplatonic curve X with abelian 
automorphism group Aut Y, and let B denote the corresponding Belyi function. 
Then, (X, B) is defined over Q. 


Corollary Quasiplatonic curves having a regular dessin with abelian automor- 
phism group can be defined over Q. 


Theorem 2.1 is already given as Proposition 3 in [2], together with a sketch of a 
proof. For the corollary, and under slightly different hypotheses, there are two more 
recent proofs in [10] and [15, Corollary 1.2], based on quite different ideas. We will 
use instead the following lemma which seems to be well known; unfortunately, we 
do not know a quotable source or where it originally comes from (the first author 
learned it from David Singerman and Gareth Jones many years ago). 
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Lemma Every regular dessin Z with abelian automorphism group is a quotient of 
some regular dessin G, of type (r,r,r) on the Fermat curve F,. of exponent r, with 
automorphism group Aut Y, = C, x C,. 


Proof (See also [11], Example 5.5 and Exercise 5.8) If we describe F,. by its 
affine model x” + y” = 1, it is easy to see that B(x, y) := x’ defines a normal 
Belyi function corresponding to the normal inclusion A’ < A of the commutator 
subgroup in the triangle group A = A(r,r,r). This commutator subgroup is the 
surface group of F’, and for the corresponding dessin Y, we have 


AutQ, = A/A’ = C,xC, 


(for r > 3 itis an index 6 normal subgroup of the full automorphism group Aut F, ). 
Now suppose that X is quasiplatonic with a regular dessin of type (/,m,n) and 
abelian automorphism group A. Let r be the lcm of /, m,n. The surface group N 
of X is then the (torsion free) kernel of a group epimorphism of the triangle group 
A(I,m,n) onto A , inducing as well an epimorphism 


Az=A(r,r,r) > A 


with some kernel J” <1 A which is in general not torsion free. We have however 
X = I’\H, and because A & A/T is abelian, we get A’ < I with a quotient 
T/A’ =: U <a C, x C,, whence 


XZU\F, and GJSU\G,. 


oO 


Proof of Theorem 2.1 With the same notations and the same affine model for F’, as 
above we have now X X U\F,.. Every automorphism a € U acts on F, by 


a: (x,y) (th, ty) 


with a primitive r-th root of unity ¢, and exponents k,h € Z/rZ. So, F, and its Belyi 
function 6 are defined over Q and Aut F’, is defined over Q(¢,) . Moreover, the 
Belyi function B on X can be considered as induced by the Belyi function 6 on the 
U-orbits in F’, (well defined because 6 has the same value for all arguments in every 
U-orbit). Clearly, X° = U°\F, for all o € GalQ/Q where o acts in the obvious 
way on the automorphisms a € U, hence B’ is induced by f on the U°-orbits. 
However, since every Galois automorphism o € GalQ/Q acts on ¢, as fH ¢ - 
for some s € Z/rZ only depending on o , we have a = a and a € U for all 
a €U,hence U° = U forallo. Because J,, that is the pair (F,, 8) is defined 
over Q, also the quotient Y = U\G, is invariant under Gal Q/Q. Therefore its 
moduli field is Q,, and by Proposition 2.5 of [3] (see point (9) of Sect. 1) Z can be 
defined over Q. oO 
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3 Examples and Generalisations 


Quasiplatonic surfaces having dessins with abelian automorphism groups exist 
in all genera g > 1, compare the Accola—Maclachlan curves described in [11, 
Exercise 5.20]: for n = 2g + 1 the automorphism group A ~ Cy, is an image 
of the triangle group A(2,n,2n) under an epimorphism with torsion free kernel I” 
such that the quotient curve X ~ [”\H has an affine model 


y = y"-] 


In fact, there are much more examples. Among the 22 isomorphism classes of 
quasiplatonic curves in the genera g = 2,3,4 there are only one curve in genus 3 
and three curves in genus 4 not having a regular dessin with abelian automorphism 
group (they are also defined over Q, but by other reasons, compare [2]), see the 
Tables 5.2—5.4 in [11]. 

Recall that a curve Y of genus > 1 has only finitely many automorphisms. If Y 
is defined over a number field K , its automorphisms can therefore also be defined 
over a fixed number field L > K: by the action of field automorphisms of C on 
Aut Y fixing K elementwise we would otherwise get an infinity of automorphisms. 
A similar argument shows that a minimally chosen L is a normal extension of K. 
Recall also that Y = I"\H for a torsion-free normal subgroup 7 < A of the 
triangle group A which is the normaliser of J’ in PSL,R. 


Theorem 3.1 Let Y be a quasiplatonic curve, B its canonical Belyi function Y > 
Aut Y\Y, let K be the moduli field ( = minimal field of definition) of (Y, B), and let 
the Galois extension L > K be a field of definition for (Y, Aut Y). 


(1) For any subgroup U < AutY, the curve X := U\Y and the Belyi function B 
induced by B on X can be defined over L. 
(2) Its relative moduli field Mx := Mx(X, B), defined as the fixed field of 


Vix.ay/K := {0 €GalQ/K | Af, :X > X° isomorphism with B® of, = By, 
is the same as the fixed field of the group 
Gu := {o €GalL/K | U° is conjugate to U in Aut Y} 
Proof 1. Can be proved similarly to Corollary 1 of [6] considering the intermediate 


fields in the function field extension L(Y)/L(B) whose Galois group! is 
antiisomorphic to Aut Y, compare [11, Theorem 4.5]. 


'Girondo et al. [6] contains a little mistake: since f is used in the base field of the Galois extension 
of function fields, we have to be sure that 6 is also defined over the field of constants; this is 
guaranteed here by the construction of K < L. In Corollary 1 of [6] this may fail if the hypotheses 
of Lemma | or Lemma 2 of [6] are not satisfied. There one can fill the gap in the hypothesis by 
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2: 


Q 


First we have to prove that if there is an a € Aut Y with the property aUa~! = 
U° , we get an isomorphism 


f : U\Y=X > X°=U°\Y with B= B of 


which is in fact induced by Y > Y : y & a(y). Second we have to show 
that for any o € GalL/K with the property (X°,B°) & (X,B) there is such 
an automorphism a € Aut Y inducing this isomorphism via the conjugation of 
U, hence o € Gy. Since o fixes Y and £, both subgroups U and U® lift to 
Fuchsian groups ®, ®° between I” and A. The isomorphism f : X — X® lifts 
to some a@ € PSL, R acting on H and conjugating ® into ®° since B = B® of . 
Moreover, a normalises J”, hence belongs to A and induces an automorphism 
ac AuyY. 

oO 


If U < AutY is a normal subgroup, X is itself quasiplatonic. If moreover K = 
, then M is the minimal field of definition of (X, B). 
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The program of the Wednesday colloquium in memory of Wolfgang Schwarz 
during the €C AZ Conference 2014 in Hildesheim: 


Domane Marienburg 


ELAZ 2014 


Elementare und Analytische Zahlentheorie 


Colloquium in memory of 
Wolfgang Schwarz 
1934 — 2013 


Aula (Hohes Haus) 


9.00 Jurgen Sander: Address of Welcome 

9.05 Lutz Lucht: Remembering Wolfgang Schwarz - his life 
and work 

9.50 Karl-Heinz Indlekofer: Arithmetische Funktionen: ein 
Schwerpunktthema im wissenschaftlichen Werk von 
Wolfgang Schwarz 


Steinscheune 
10.40 Coffee break 


Ackerpferdestall 
11.10 Eva Schwarz (Violine), Valentin Blomer (Klavier) 
W. Schwarz, Romonze fir Violine und Ktavier 


L. v. Beethoven, Sonate G-Dur op. 31/1 
Allegro vivace 
Adagio grazioso 
Rondo - Allegretto 


A. Dvorak, Romantische Sticke op. 75, Nr. 1 und 3 
Allegro moderato 
Allegro appassionato 


la_(H H: 


11.45 Jérn Steuding: On Liouville Numbers — Yet Another 
Application of Functional Analysis in Number Theory 


Steinscheune 
12.30 Lunch 


Besides some biographical and scientific lectures the audience was listening to a 
concert of classical music; see the audience eagerly waiting for the beginning. 


Published with kind permission of © Isa Lange, 2014 
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Published with kind permission of © Isa Lange, 2014 


Wolfgang Schwarz’s daughter Eva (violin) and Valentin Blomer (piano) were 
playing some pieces composed by Beethoven, Dvorak, and Schwarz himself. 


Bay \N 


Published with kind permission of © Isa Lange, 2014 


The conference photography below was taken during this special event at the 
Doméine Marienburg of the University of Hildesheim. 


= ft ee eek 
Published with kind permission of © Isa Lange, 2014 
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